AI Wiki
Category

Speech & Audio AI

70 articles

AI Voice Agent

AI Agents, Artificial Intelligence, Conversational AI

AssemblyAI

AI Companies, Voice AI

Audio Classification Models

Deep Learning, Machine Learning

Audio Models

AI Models

Audio-to-Audio Models

AI Models, Music & Audio Generation

AudioCraft

Deep Learning, Generative AI, Meta AI

AudioLM

Google, Music & Audio Generation

Automatic Speech Recognition Models

AI Models

Cartesia

AI Companies, AI Models, Voice AI

CosyVoice

Chinese AI, Voice AI

Deepgram

AI Companies, Natural Language Processing, Voice AI

Deepgram Nova-3

AI Models, Voice AI

Descript

AI Tools & Products, Artificial Intelligence, Video Generation

DolphinGemma

AI for Science, Google

ElevenLabs

AI Companies, Generative AI, Voice AI

ElevenLabs Music

AI Models, Generative AI, Music & Audio Generation

ElevenLabs v3

AI Models, Generative AI

EnCodec

Meta AI

F5-TTS

AI Models, Open Source AI

Fireflies.ai

AI Tools & Products, Artificial Intelligence

GLM-4-Voice

Chinese AI, Voice AI

GPT-Realtime / OpenAI Realtime API

OpenAI, Voice AI

Gladia

AI Companies, Voice AI

HuBERT

Machine Learning

Hume AI

AI Companies, Conversational AI, Voice AI

Hume Octave 2

AI Models, Generative AI

Inworld AI

AI Companies, AI in Gaming, Conversational AI

Kai-Fu Lee

Chinese AI, People, Venture Capital

Krisp AI

AI Tools & Products

LibriSpeech

AI Benchmarks, Natural Language Processing

Lyria

AI Models, Generative AI, Google DeepMind

Massively Multilingual Speech (MMS)

Meta AI, Open Source AI

Moshi

AI Models, Conversational AI, Open Source AI

Murf AI

AI Tools & Products, Voice AI

Music

AI Tools & Products, Generative AI

NVIDIA Canary

AI Models, NVIDIA

NVIDIA Parakeet

NVIDIA, Open Source AI

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E)

Generative AI, Microsoft

OpenAI Realtime API

Conversational AI, Developer Tools, OpenAI

Otter.ai

AI Tools & Products, Artificial Intelligence

PlayHT

AI Companies, Generative AI, Voice AI

Qwen2-Audio

Chinese AI, Multimodal AI

Resemble AI

AI Companies, Generative AI, Voice AI

Rime (company)

AI Companies, Generative AI, Voice AI

SUPERB

AI Benchmarks, Machine Learning

SeamlessM4T

Meta AI, Natural Language Processing

Sesame (AI company)

AI Companies, Open Source AI, Voice AI

Sesame CSM

AI Models, Generative AI, Open Source AI

SoundStream

Google

Speech recognition

Deep Learning, Machine Learning, Natural Language Processing

Speechmatics

AI Companies, Voice AI

SpiRit-LM

Large Language Models, Meta AI

Stable Audio 2.5

AI Models, Generative AI, Music & Audio Generation

Suno

AI Companies, Generative AI, Music & Audio Generation

Suno v5

AI Models, Generative AI, Music & Audio Generation

Superwhisper

AI Tools & Products, Voice AI

Text-to-Speech Models

AI Models

Universal Speech Model

AI Models, Natural Language Processing

Voice Activity Detection Models

AI Models

Voice Engine (OpenAI)

OpenAI, Voice AI