Skip to main content
← Back to categories

Category read

Audio & Speech

AI-Buzz tracks 12 Audio & Speech companies. Current signals are clearest in package pull, public code usage, and downstream package usage. Current leaders include ElevenLabs, Cartesia, and Play.ht. The current snapshot covers 14.7M package downloads per month and 157 public importing repos.

5 of 12 companies with package pull3 of 12 companies with public code signals3 of 12 companies with distribution signals

Companies

12

Core companies in scope

Package pull

14.7M/30d

5 companies with registry activity

Public code

157

3 companies with imports detected

Dependents

368

4 companies with downstream packages

Funding

$388M

8 companies with disclosed rounds

Lead signals

What shows up first in this category

package pull, public code usage, and downstream package usage are the clearest aggregate signals across Audio & Speech right now.

Package pull

14.7M/30d

5 of 12 companies show registry usage; ElevenLabs currently leads.

Public code

157

3 companies show public imports; ElevenLabs leads.

Dependents

368

4 companies show downstream package usage.

Reading

How the current signals read

package pull and public code are the clearest category-wide signals right now.

Package pull

14.7M/30d

Across 5 of 12 tracked companies

Public code

157

3 companies with imports detected

Dependents

368

Across 4 companies with downstream packages

Funding

$388M

Across 8 companies with disclosed rounds

Key facts

  • ElevenLabs is the current DAI leader at 63.
  • ElevenLabs leads current package pull with 13.9M/30d.
  • ElevenLabs shows the widest public code footprint with 152 importing repositories.
  • Suno has the largest disclosed funding footprint at $125M.

Coverage

  • 5 of 12 companies with package pull
  • 3 of 12 companies with public code imports
  • 4 of 12 companies with downstream package usage
  • 3 of 12 companies with distribution coverage
  • Latest category snapshot synced April 13, 2026.

Signal routes

Where to inspect this category next

Use the category summary to identify where the signal mix is strongest, then open the company pages behind it.

Company routes

Read the strongest company pages next

These cards surface the strongest signal mixes, then link straight into the company pages with the underlying detail.

View all companies →
#1
ElevenLabs logo
ElevenLabs

Best-in-class AI voice synthesis. Clones voices with minimal samples.

Moderate

13.9M/30d package pull, 152 public repos, and 316 dependents.

DAI

63

Package pull

13.9M/30d

Public code

152

Dependents

316

2.9K GitHub stars
Open company page →
#2
Cartesia logo
Cartesia

Real-time voice generation with state space model architecture

Moderate

632.8K/30d package pull, 35 dependents, and $32M funding.

DAI

43

Package pull

632.8K/30d

Dependents

35

Funding

$32M

121 GitHub stars
Open company page →
#3
Play.ht logo
Play.ht

AI voice generation platform with realistic text-to-speech

Moderate

93.6K/30d package pull, 2 public repos, and 14 dependents.

DAI

33

Package pull

93.6K/30d

Public code

2

Dependents

14

220 GitHub stars
Open company page →
#4
Resemble AI logo
Resemble AI

AI voice cloning and speech synthesis platform for developers

Moderate

39.8K/30d package pull, 3 public repos, and 3 dependents.

DAI

30

Package pull

39.8K/30d

Public code

3

Dependents

3

24.3K GitHub stars
Open company page →
#5
Fish Audio logo
Fish Audio

Open-source multilingual text-to-speech and voice cloning platform

Moderate

460/30d package pull.

DAI

17

Package pull

460/30d

Limited repo-health coverage
Open company page →
#6
Krisp logo
Krisp

AI-powered noise cancellation, transcription, and meeting assistant for calls and conferencing.

3/30d HN mentions.

DAI

0

HN

3/30d

Limited repo-health coverage
Open company page →
#7
Descript logo
Descript

All-in-one audio/video editor. Edit video by editing text transcript.

Moderate

$100M funding and 7/30d HN mentions.

DAI

0

Funding

$100M

HN

7/30d

1.8K GitHub stars
Open company page →
#9
LOVO logo
LOVO

AI voiceover platform with library of synthetic voices.

$7M funding.

DAI

0

Funding

$7M

Limited repo-health coverage
Open company page →
#10
Udio logo
Udio

AI music creation platform. Generates songs with multiple styles and genres.

$1K funding and 4/30d HN mentions.

DAI

0

Funding

$1K

HN

4/30d

Limited repo-health coverage
Open company page →

All 12 companies in this category

Comparison table

Compare the full category

Sort by the signal that matters most for this category: combined DAI, package pull, public code adoption, hiring demand, or funding depth.

CompanyDAI ScoreDownloads (30d)Public CodeHiringFunding
ElevenLabs

Best-in-class AI voice synthesis. Clones voices with minimal samples.

6313.9M152-$101M
Cartesia

Real-time voice generation with state space model architecture

43632.8K--$32M
Play.ht

AI voice generation platform with realistic text-to-speech

3393.6K2--
Resemble AI

AI voice cloning and speech synthesis platform for developers

3039.8K3-$12M
Fish Audio

Open-source multilingual text-to-speech and voice cloning platform

17460---
Krisp

AI-powered noise cancellation, transcription, and meeting assistant for calls and conferencing.

-----
Descript

All-in-one audio/video editor. Edit video by editing text transcript.

----$100M
Speechify

Text-to-speech app that reads any text aloud. Accessibility focused.

-----
LOVO

AI voiceover platform with library of synthetic voices.

----$7M
Udio

AI music creation platform. Generates songs with multiple styles and genres.

----$1K
Suno

AI music generation. Creates songs from text prompts with vocals.

----$125M
Murf AI

AI voiceover platform for presentations, videos, and e-learning.

----$12M

FAQ

Common questions

What are the top Audio & Speech AI companies?

The current DAI leaders in Audio & Speech include ElevenLabs (DAI 63), Cartesia (DAI 43), and Play.ht (DAI 33). Use the company pages to inspect the package pull, public code, hiring, and funding signals behind those rankings.

How much public adoption shows up in Audio & Speech?

Across 12 tracked companies, current public signals show 14.7M package downloads per month, 368 known dependents, and 157 public importing repos. Coverage is uneven by company, so the detail pages show which signals are actually present for each one.

How much funding have Audio & Speech AI companies raised?

Audio & Speech AI companies disclose $388M in combined funding across 8 tracked companies with known rounds.

How much developer discussion shows up around Audio & Speech?

The current snapshot tracks 95 Hacker News mentions across 7 companies in Audio & Speech. Discussion is supportive context rather than the lead signal.

Where should I start when reviewing Audio & Speech?

Start with the strongest signal routes: ElevenLabs leads package pull and ElevenLabs leads public code usage. Then compare those companies against the full table to see whether the category is concentrated or broad.

Browse: AI Agents · AI Infrastructure · AI Robotics · AI Search · Audio Generation · Code Generation · Content Writing · Conversational AI · Creative Tools · Customer Support · Developer Tools · Foundation Models · Healthcare · Image Generation · Legal · Other · Productivity · Robotics · Sales & Marketing · Search & Discovery · Social Media AI · Transcription · Translation · Video Editing · Video Generation · All categories