Cartesia
Cartesia
Low-latency streaming TTS. Sub-100ms first audio.
Streaming-first speech synthesis built around the Sonic family of state-space models. Aims at real-time agent voices where latency between turns is the product. Strong choice for sub-200ms voice loops.
Worth knowing
Founded in 2023 by the Stanford AI Lab team behind state-space models and Mamba, incl. Albert Gu and Karan Goel.
- tts
- streaming
- low-latency
- real-time