Strongest current signal
Package downloads
188.3M/30d
Tracked on PyPI
Data and AI platform, creator of MLflow and Dolly
Best current coverage: 188.3M downloads/30d, 240 dependents, and 93 public repos.
Lead signals
Package pull and public code usage both show up clearly for Databricks.
Strongest current signal
188.3M/30d
Tracked on PyPI
240
Known dependents on PyPI
540
Main repository stars
30/30d
Ranked #14 in category discussion
Research Brief
No recent Research Brief centers Databricks yet. Start with the latest market reporting, then return here for the stored company signals on this page.
Browse Research Briefs →Reading
package pull and existing public code are the clearest current signals for Databricks.
Package pull
188.3M/30d
Tracked package pull on PyPI
Existing public code
93
Public repositories importing tracked packages
Downstream usage
240
Known dependents on PyPI
GitHub attention
540
Main repository stars
Coverage includes npm and PyPI registries, GitHub, public code import detection, developer discussion, distribution surfaces, and recent company news where available. As of April 13, 2026. Methodology
Sustainability and maintenance signals from the primary public repository.
These signals come from public code import detection and tracked hiring posts rather than registry totals alone.
Public repositories and source files importing packages tied to Databricks.
Background and reference details
background, categories, funding, and tools stay collapsed until you need them.
Databricks provides a unified Lakehouse Platform that combines data warehousing and data lake capabilities, enabling organizations to manage and process all their data for analytics and AI workloads. They are significant in the AI ecosystem for their contributions to open-source technologies like Apache Spark, Delta Lake, and MLflow, and for developing foundation models like Dolly.
Raised $20.0B total - DAI rank #23 (top 9%) suggests strong developer adoption relative to funding.
Nvidia, T. Rowe Price
Counterpoint Global
Franklin Templeton
Andreessen Horowitz
Andreessen Horowitz
Andreessen Horowitz
SineWave Ventures
New Enterprise Associates
Andreessen Horowitz
2.5M PyPI(50% of company total)
Data and AI platform, creator of MLflow and Dolly
2.5M PyPI(50% of company total)
A unified platform for data, analytics, and AI workloads, combining the best aspects of data lakes and data warehouses.
An open-source storage layer that brings ACID transactions, scalable metadata handling, and unified streaming and batch data processing to data lakes.
An open-source, instruction-following large language model (LLM) fine-tuned on a human-generated instruction dataset.
An open-source platform for managing the end-to-end machine learning lifecycle, including experimentation, reproducibility, and deployment.
A unified governance solution for data and AI on the Lakehouse Platform, providing centralized access control, auditing, and lineage.
Public pricing snapshots collected for Databricks
Source: Company pricing pageUpdates: WeeklyNote: Extracted via automated page analysis; verify on sourceMethodology →Historical metrics for Databricks
Databricks: PyPI Downloads down 90% (25.6M to 2.5M). GitHub Stars up 2% (529 to 540). HN Mentions down 90% (10 to 1).
| Date | PyPI Downloads | GitHub Stars | HN Mentions |
|---|---|---|---|
| Mar 15, 2026 | 25.6M | 529 | 10 |
| Mar 22, 2026 | 26.3M | 529 | 3 |
| Mar 29, 2026 | 27.1M | 535 | 8 |
| Apr 5, 2026 | 27.4M | 537 | 10 |
| Apr 12, 2026 | 15.0M | 539 | 3 |
| Apr 13, 2026 | 2.5M | 540 | 1 |