Speech & Audio AI
70 articles
AI Voice Agent
AI Agents, Artificial Intelligence, Conversational AI
AssemblyAI
AI Companies, Voice AI
Audio Classification Models
Deep Learning, Machine Learning
Audio Models
AI Models
Audio-to-Audio Models
AI Models, Music & Audio Generation
AudioCraft
Deep Learning, Generative AI, Meta AI
AudioLM
Google, Music & Audio Generation
Automatic Speech Recognition Models
AI Models
Cartesia
AI Companies, AI Models, Voice AI
CosyVoice
Chinese AI, Voice AI
Deepgram
AI Companies, Natural Language Processing, Voice AI
Deepgram Nova-3
AI Models, Voice AI
Descript
AI Tools & Products, Artificial Intelligence, Video Generation
DolphinGemma
AI for Science, Google
ElevenLabs
AI Companies, Generative AI, Voice AI
ElevenLabs Music
AI Models, Generative AI, Music & Audio Generation
ElevenLabs v3
AI Models, Generative AI
EnCodec
Meta AI
F5-TTS
AI Models, Open Source AI
Fireflies.ai
AI Tools & Products, Artificial Intelligence
GLM-4-Voice
Chinese AI, Voice AI
GPT-Realtime / OpenAI Realtime API
OpenAI, Voice AI
Gladia
AI Companies, Voice AI
HuBERT
Machine Learning
Hume AI
AI Companies, Conversational AI, Voice AI
Hume Octave 2
AI Models, Generative AI
Inworld AI
AI Companies, AI in Gaming, Conversational AI
Kai-Fu Lee
Chinese AI, People, Venture Capital
Krisp AI
AI Tools & Products
LibriSpeech
AI Benchmarks, Natural Language Processing
Lyria
AI Models, Generative AI, Google DeepMind
Massively Multilingual Speech (MMS)
Meta AI, Open Source AI
Moshi
AI Models, Conversational AI, Open Source AI
Murf AI
AI Tools & Products, Voice AI
Music
AI Tools & Products, Generative AI
NVIDIA Canary
AI Models, NVIDIA
NVIDIA Parakeet
NVIDIA, Open Source AI
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E)
Generative AI, Microsoft
OpenAI Realtime API
Conversational AI, Developer Tools, OpenAI
Otter.ai
AI Tools & Products, Artificial Intelligence
PlayHT
AI Companies, Generative AI, Voice AI
Qwen2-Audio
Chinese AI, Multimodal AI
Resemble AI
AI Companies, Generative AI, Voice AI
Rime (company)
AI Companies, Generative AI, Voice AI
SUPERB
AI Benchmarks, Machine Learning
SeamlessM4T
Meta AI, Natural Language Processing
Sesame (AI company)
AI Companies, Open Source AI, Voice AI
Sesame CSM
AI Models, Generative AI, Open Source AI
SoundStream
Speech recognition
Deep Learning, Machine Learning, Natural Language Processing
Speechmatics
AI Companies, Voice AI
SpiRit-LM
Large Language Models, Meta AI
Stable Audio 2.5
AI Models, Generative AI, Music & Audio Generation
Suno
AI Companies, Generative AI, Music & Audio Generation
Suno v5
AI Models, Generative AI, Music & Audio Generation
Superwhisper
AI Tools & Products, Voice AI
Text-to-Speech Models
AI Models
Universal Speech Model
AI Models, Natural Language Processing
Voice Activity Detection Models
AI Models
Voice Engine (OpenAI)
OpenAI, Voice AI