Skip to main content
Snorkel AI logo

Snorkel AI

Data-centric AI platform for programmatic data labeling and curation

AI InfrastructureFounded 2019#39 of 54 in AI Infrastructure

Updated May 29, 2026

Follow this company

Follow this company to revisit its latest research cards from your account.

Company profile

What is Snorkel AI?

Snorkel AI provides a data-centric AI platform, Snorkel Flow, that enables enterprises to programmatically label, curate, and manage training data. By focusing on improving data quality and quantity, the platform helps organizations accelerate the development and deployment of AI applications. Its significance lies in addressing the critical bottleneck of data preparation, making AI development more efficient and scalable.

Company research

Data as of May 29, 2026

No company research is published for this company yet. All research →

Latest company data

Primary data point

Downloads

63.3K/30d

Tracked package: PyPI snorkel

▼ -26%Updated 1d ago

Other data points

Dependent projects

15

Projects depending on tracked package: PyPI snorkel

Updated 17h ago

GitHub stars

6.0K

Main repository stars

Updated 17h ago

Code usage

65

Public projects using this company's tools

Updated 17h ago

Repository health

Maintenance data from the main open-source repository.

OpenSSF Scorecard
3.5
Releases (30d)
0

Repository usage

Public repositories and source files importing packages tied to Snorkel AI.

Repos importing
650%

About Snorkel AI

Snorkel AI provides a data-centric AI platform, Snorkel Flow, that enables enterprises to programmatically label, curate, and manage training data. By focusing on improving data quality and quantity, the platform helps organizations accelerate the development and deployment of AI applications. Its significance lies in addressing the critical bottleneck of data preparation, making AI development more efficient and scalable.

FoundersAlex Ratner, Chris Ré, Henry Ehrenberg, Stephen H. Bach

Funding

$138M · 4 rounds

Raised $138M total. Category position #39 of 54 in AI Infrastructure.

Series C2021

Addition, BlackRock

$85M
Series B2021

BlackRock

$35M
Series A2020

GV, Greylock

$15M
Seed2019

GV, Greylock, In-Q-Tel

$3M

Investors

AdditionBlackRockGreylockGVIn-Q-Tel

Tracked packages (1)

1 PyPI

snorkel

PyPIMain PyPI package

snorkel

Data-centric AI platform for programmatic data labeling and curation