ToolsGazer logoToolsGazer
icon of Hugging Face Datasets - Hacker News

Hugging Face Datasets - Hacker News

Explore the complete Hacker News archive: every story, comment, and more, live-updated every 5 minutes, for comprehensive tech insights.

Introduction

Core Value Proposition

Unlock a wealth of insights from the Hacker News community with our comprehensive dataset. Gain access to every story, comment, Ask HN, Show HN, job posting, and poll since 2006, updated every 5 minutes. This dataset provides unparalleled access to one of the most influential tech communities on the internet.

Key Features
  • Complete Archive: Access all Hacker News content from its inception.
  • Live Updates: Updated every 5 minutes to ensure the most current information.
  • Two Configs: Default (entire dataset) and Today (daily data)
  • Parquet Format: Efficient storage and retrieval of data.
  • Tabular and Textual Data: Supports various analysis types.
Use Cases
  • Sentiment Analysis: Analyze community sentiment towards specific technologies or companies.
  • Trend Identification: Identify emerging trends and topics within the tech industry.
  • Natural Language Processing (NLP) Tasks: Use the dataset for text generation, feature extraction, and text classification.
  • Question Answering: Train models to answer questions based on Hacker News content.
  • Market Research: Gather insights into user preferences and opinions.
  • Building Community Engagement Tools: Build tools which promote increased engagement for communities, companies, or projects.
Data Details
  • Size: 10M < n < 100M
  • Number of Rows: Over 47 million
  • License: odc-by
  • File Format: Parquet

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates