Murf AI

AI voice generator studio with dubbing and a low-latency TTS API.

Category: Voice
Pricing: FREEMIUM
Source: Proprietary
Hosting: Cloud
Platforms: WebAPI
Models: Self-contained (on-device)
Verified: Jun 11, 2026

Murf AI is a text-to-speech platform pairing a studio editor — 200+ voices across 35+ languages, voice cloning, dubbing, and a voice changer — with developer APIs. Its Gen 2 speech model focuses on pronunciation accuracy and granular voice controls, while the Falcon API targets sub-130ms latency for real-time voice agents. Integrations include Canva, PowerPoint, and Google Slides.

Capabilities 3

What it actually does — grouped by capability family.

Speech synthesis (TTS) (primary capability)
Voice cloning (secondary capability)
Dubbing (secondary capability)

Pros & cons

200+ voices, 35+ languages
Canva/Slides/PowerPoint integrations
Low-latency Falcon API
Free plan to test

Commercial rights need a paid plan
Voice cloning gated to higher tiers
Less expressive than newer voice models

View ElevenLabs details
VoiceFREEMIUM
ElevenLabs
ElevenLabs
Text-to-speech, voice cloning, and multilingual dubbing.
Hosted speech synthesis at near-human quality — TTS, voice cloning, multilingual dubbing, and conversational voice agents. Default choice when you need a voice that sounds like a person, not a robot.
Best-in-class voice realism
Pricier than commodity TTS at scale
- tts
- voice-cloning
- dubbing
- multilingual
Open
View Speechify details
VoiceFREEMIUM
Speechify
Speechify
AI text-to-speech that reads any document, PDF, or page aloud.
Speechify is an AI text-to-speech app that turns articles, PDFs, emails, and books into natural-sounding audio with high-definition voices, adjustable speed, and OCR for scanned text. It runs on iOS, Android, web, a browser extension, and desktop, and offers a separate Studio product plus a text-to-speech API for developers.
OCR reads scanned text and PDFs
Best features behind paywall
- text-to-speech
- read-aloud
- accessibility
- voice-cloning
Open
View Cartesia details
VoiceFREEMIUM
Cartesia
Cartesia
Low-latency streaming text-to-speech for real-time voice.
Streaming-first speech synthesis built around the Sonic family of state-space models. Aims at real-time agent voices where latency between turns is the product. Strong choice for sub-200ms voice loops.
Streaming over WebSocket for fast first audio
Long-form expressive texture trails ElevenLabs
- tts
- streaming
- low-latency
- real-time
Open

Open Murf AI

Murf AI

Capabilities 3

Pros & cons

Tags

Further reading

ElevenLabs

Speechify

Cartesia