Question 1

How does AI-Buzz track package downloads?

Accepted Answer

We fetch daily download counts for each tracked company's npm and PyPI packages using the public npm registry API and PyPI Stats API. Daily counts are summed into 30-day totals with month-over-month trends. Companies with multiple packages have downloads aggregated across their product portfolio.

Question 2

How does AI-Buzz track Hacker News mentions?

Accepted Answer

We use the Algolia HN Search API to track how often tracked companies and products are mentioned in Hacker News stories and comments. We count mentions over 30-day windows and classify sentiment as positive, negative, or neutral. These are supporting signals, not the core ranking input.

Question 3

How does GitHub activity tracking work?

Accepted Answer

For open-source AI companies, we track star counts, contributor activity, and commit trends via the GitHub API. We count unique commit authors over 30-day windows to measure active development engagement.

Question 4

How is funding data curated?

Accepted Answer

We scan RSS feeds from TechCrunch, VentureBeat, and other sources to detect funding announcements. Each round is manually verified before inclusion. We track total funding, round size, lead investors, and round type.

Question 5

When are AI-Buzz metrics updated?

Accepted Answer

Most signals sync daily at 6 AM UTC. Some signals (dependents, Docker Hub, Hugging Face, Reddit) sync weekly. Job demand syncs monthly. Funding data is updated as new rounds are announced and verified.

Question 6

What are the known limitations of download data?

Accepted Answer

CI/CD pipelines can inflate download counts. Monorepo structures may spread downloads across many packages, understating total adoption. Downloads from private registries are not counted, so enterprise usage is likely underrepresented.

Question 7

How are AI-Buzz Research Briefs produced?

Accepted Answer

Research Briefs are AI-assisted signal notes. AI drafts the brief from structured company metrics - generating the claim, key facts, interpretation, and proof table. A human editor selects the question, verifies figures against source data, approves or rejects the claim, and controls the publish decision. Each brief carries a disclosure stating this process.

Question 8

How does AI-Buzz handle corrections to Research Briefs?

Accepted Answer

If interpretation changes due to new data, we update the brief with a revision note and new as-of date. The original claim is not silently changed.

Question 9

How does AI-Buzz ensure data quality?

Accepted Answer

Every data point goes through a 4-layer validation pipeline: automated integrity checks, external source verification against live APIs, upper-bound and delta-change validation at write time, and confidence scoring. Each company receives a 0-100 data confidence score reflecting completeness, freshness, and verification status.

Question 10

What is the data confidence score?

Accepted Answer

The data confidence score is a 0-100 rating that reflects how complete, verified, and fresh a company's data is. It considers profile completeness, whether identifiers (GitHub repos, npm/PyPI packages) have been externally verified, how recently metrics were updated, and whether all signal scores have been computed.

Question 11

What is the Developer Adoption Index formula?

Accepted Answer

The DAI uses 4 weighted signals in 2 categories. Core Adoption (60%): Package Downloads 40%, Dependents 20%. Growth & Activity (40%): Download Growth 25%, GitHub Contributors 15%. All signals are from external sources. Each signal is normalized to 0-1 using percentile ranking before weighting.

Question 12

How is the Developer Adoption Index validated?

Accepted Answer

DAI scores are validated through cross-signal consistency checks, time-series stability monitoring, and correlation analysis against external adoption indicators. Scores abort computation if more than 10% of input metrics are stale (older than 7 days). Weight changes are reviewed quarterly and logged publicly in the data changelog.

Signal	Source	What's Measured	Frequency
Package Downloads	npm, PyPI	Daily download counts per tool, summed into 30-day totals with month-over-month trends. Multi-package companies aggregate across all tools.	Daily
GitHub Activity	GitHub API	Stars, forks, and growth trends	Daily
GitHub Contributors	GitHub API	Unique commit authors in last 30 days across all company repos	Daily
npm/PyPI Dependents	Libraries.io	Count of packages that depend on company npm/PyPI packages	Weekly
Code Adoption	ecosyste.ms	Public repositories importing company packages (dependent repo count)	Daily
HN Mentions & Sentiment	HN API	30-day mention counts, discussion share by category. Sentiment classified by Gemini LLM (positive / neutral / negative, ~80% accuracy). Only shown when sample size >= 25 comments.	Daily
Reddit Mentions & Sentiment	Reddit Search API	30-day mention counts in ML subreddits (r/MachineLearning, r/LocalLLaMA, r/artificial). Sentiment classified by Gemini LLM.	Weekly
Job Demand	HN API	Company mentions in monthly HN hiring threads. Skews toward startup/tech roles.	Monthly
Docker Hub Pulls	Docker Hub	Container adoption	Weekly
Hugging Face Downloads	Hugging Face	ML model adoption	Weekly
Hugging Face Models	Hugging Face	ML model portfolio breadth	Weekly
Funding	TechCrunch, VentureBeat, Wikipedia	Round size, type, date, lead investors	As announced, manually verified
Company News	TechCrunch, VentureBeat, major tech publications	News articles tracked per company from major tech publications	Daily

Signal	Source	What's Measured	Frequency
Package Downloads	npm, PyPI	Daily download counts per tool, summed into 30-day totals with month-over-month trends. Multi-package companies aggregate across all tools.	Daily
GitHub Activity	GitHub API	Stars, forks, and growth trends	Daily
GitHub Contributors	GitHub API	Unique commit authors in last 30 days across all company repos	Daily
npm/PyPI Dependents	Libraries.io	Count of packages that depend on company npm/PyPI packages	Weekly
Code Adoption	ecosyste.ms	Public repositories importing company packages (dependent repo count)	Daily
HN Mentions & Sentiment	HN API	30-day mention counts, discussion share by category. Sentiment classified by Gemini LLM (positive / neutral / negative, ~80% accuracy). Only shown when sample size >= 25 comments.	Daily
Reddit Mentions & Sentiment	Reddit Search API	30-day mention counts in ML subreddits (r/MachineLearning, r/LocalLLaMA, r/artificial). Sentiment classified by Gemini LLM.	Weekly
Job Demand	HN API	Company mentions in monthly HN hiring threads. Skews toward startup/tech roles.	Monthly
Docker Hub Pulls	Docker Hub	Container adoption	Weekly
Hugging Face Downloads	Hugging Face	ML model adoption	Weekly
Hugging Face Models	Hugging Face	ML model portfolio breadth	Weekly
Funding	TechCrunch, VentureBeat, Wikipedia	Round size, type, date, lead investors	As announced, manually verified
Company News	TechCrunch, VentureBeat, major tech publications	News articles tracked per company from major tech publications	Daily

Methodology

Coverage

AI-Assisted Briefs

What We Track

Data Confidence

Company confidence score (0–100)

Minimum thresholds

Known Limitations

Data Quality and Corrections

Validation pipeline

Funding verification

Corrections

Update Cadence

Developer Adoption Index (DAI)

Momentum Score

Explore the Data

Changelog