Skip to main content
AI Buzz logo
Unstructured logo, AI Infrastructure AI company

Unstructured

ETL for unstructured data preprocessing

Founded by: Brian Raymond, Matthew Harrison

DMI: 14#95 of 262HN 30d: 72
14,112 · 1,182 forks
AI InfrastructureMethodologyVisit Website →
Share:
Limited data·Missing: npm downloads, npm dependentsAbout our data

Developer Adoption

Package downloads and ecosystem metrics, 30-day window

PyPI Downloads (30d)
4.7M +9%
unstructured
Active Contributors (30d)
10
Unique committers, 30-day window

Active Contributors Trend

Insufficient data for trend chart

Need at least 2 data points (currently have 1)

PyPI Download Trend

2,075,7562,783,5123,491,2684,199,0234,906,779Dec 10Jan 7Jan 29Feb 12Feb 22Feb 26
90-day trend↑ +30.0%

pypi downloads grew from 3,626,760 to 4,716,358 (+30.0%) over the last 90 days.

Developer Community Metrics

Metrics computed from HN discussion, GitHub activity, and funding data.

HN Discussion Rank#8 in AI Infrastructure by HN mentions (out of 75 tracked)
HN vs Category36.0x category median (72 vs 2 median)
GitHub VelocityCommit velocity paused (0.0x vs 4-week average)
Funding Trajectory3 rounds in 12 months, ($25M → $40M → $40M)
Community Sentiment34% positive HN sentiment (71 comments) • High confidence

Category Benchmark

75 AI Infrastructure companies
4.7M downloads/mo274% above median
0Median: 1.3M2.5M+

Funding Rounds

Series B$40M

2024

Investors:

Series A$40M

2024

Investors:

Seed$25M

2023

Investors:

Cite This Data

According to AI-Buzz, Unstructured ranks #8 in AI Infrastructure for HN discussion share (out of 75 tracked with 4.4% of HN discussion), with 34% positive developer sentiment (71 HN comments analyzed), with 4,716,358 PyPI downloads in 30 days, with 10 active contributors in 30 days.

Source: https://www.ai-buzz.com/companies/unstructured?utm_source=citation&utm_medium=referral&utm_campaign=cite_this_data

Metrics derived from public APIs (HN Algolia, GitHub, npm/PyPI). Sentiment classified by AI. See methodology for details →

About

Description

ETL for unstructured data preprocessing

Estimated Company Size

100 - 250 employees

Details

Founded

2022

Description

Unstructured specializes in extracting and transforming complex unstructured data from diverse sources like PDFs, images, and emails into clean, structured formats. Their tools are critical for preparing high-quality input data for large language models (LLMs) and other AI applications, enabling organizations to build more accurate and effective AI solutions.

Open Source

14.1K
Stars
1.2K
Forks
ActiveHTML
Last commit: 1 day ago
View on GitHub →

Embed Badge

Unstructured DMI badge

Weekly AI Intelligence

Which AI companies are developers actually adopting? We track npm and PyPI downloads for 262+ companies. Get the biggest shifts weekly — before they show up in the news.