Skip to content

VoiceLMNT

LMNT

Fast, lifelike, affordable text-to-speech with low-latency streaming and voice cloning.

Category
Voice
Pricing
FREEMIUM
Hosting
Cloud
Platforms
WebAPI
Models
Self-contained (on-device)
Verified
Jun 16, 2026

LMNT is an AI text-to-speech platform that turns text into natural speech with ultra-low latency, built for conversational agents, games, and real-time apps. It supports instant voice cloning from a short sample and multilingual synthesis, and is exposed as a developer API plus a web playground. It is offered as a built-in voice provider across major voice-agent frameworks.

Pros & cons

  • Low-latency streaming (~150–200ms)
  • Voice cloning from a 5-second sample
  • Free tier plus affordable paid plans
  • Integrates with major voice-agent stacks
  • Commercial license on paid tiers
  • Smaller voice library than ElevenLabs
  • Quality trails top expressive TTS models
  • Less brand recognition than incumbents

Tags

View all Voice
  • View Cartesia details
    VoiceFREEMIUM

    Cartesia

    Cartesia

    Low-latency streaming TTS. Sub-100ms first audio.

    Streaming-first speech synthesis built around the Sonic family of state-space models. Aims at real-time agent voices where latency between turns is the product. Strong choice for sub-200ms voice loops.

    Worth knowing

    Founded in 2023 by the Stanford AI Lab team behind state-space models and Mamba, incl. Albert Gu and Karan Goel.

    • tts
    • streaming
    • low-latency
    • real-time
  • View Rime details
    VoiceFREEMIUM

    Rime

    Rime

    Enterprise text-to-speech built for real-time voice agents.

    Rime builds AI voice models for high-stakes business conversations like IVRs, contact centers, and AI phone agents. Its Arcana and Mist models target ultra-low latency and natural, conversational delivery, with deterministic pronunciation control so terms are spoken consistently without retraining. Rime can be deployed on-prem, in a VPC, or via cloud API, and is offered directly or through voice-AI partner platforms.

    Worth knowing

    Open-sourced Rimecaster in 2025, billed as the first open speaker model trained on natural conversational — not audiobook — speech.

    • text-to-speech
    • voice-ai
    • tts
    • contact-center
    • +1
  • View ElevenLabs details
    VoiceFREEMIUM

    ElevenLabs

    ElevenLabs

    Frontier TTS, voice cloning, and dubbing. Industry default.

    Hosted speech synthesis at near-human quality — TTS, voice cloning, multilingual dubbing, and conversational voice agents. Default choice when you need a voice that sounds like a person, not a robot.

    Worth knowing

    Founded in 2022 by two Polish friends (ex-Google and ex-Palantir); a 2026 raise valued it at $11B.

    • tts
    • voice-cloning
    • dubbing
    • multilingual
  • View Neuphonic details
    VoiceFREEMIUMOpen core

    Neuphonic

    Neuphonic

    Ultra-low-latency text-to-speech that runs on-device.

    Neuphonic is a voice-AI company building text-to-speech and voice cloning that run locally with very low latency. Its cloud API targets real-time voice agents, and in October 2025 it open-sourced NeuTTS Air, a 748M-parameter speech language model that runs on CPU via llama.cpp and clones a voice from a few seconds of audio. Aimed at private, offline, and voice-agent use cases.

    Worth knowing

    Open-sourced NeuTTS Air in Oct 2025, an Apache-2.0 748M-param TTS model that runs on CPU and clones a voice from ~3 seconds of audio.

    • text-to-speech
    • voice-cloning
    • on-device
    • open-source