EvalHamming

Hamming

Automated testing and monitoring for voice and chat agents.

Categories: EvalObservability
Pricing: PAID
Source: Proprietary
Hosting: Cloud
Platforms: WebAPI
Models: Model-agnostic
Verified: Jun 19, 2026

Hamming is an enterprise platform for testing and monitoring conversational AI agents. It auto-generates test scenarios from an agent's prompt, load-tests with tens of thousands of concurrent calls, replays production calls for regression testing, and scores 50+ audio-native metrics like latency, hallucinations, sentiment, and compliance. It integrates natively with Vapi, Retell, ElevenLabs, LiveKit, and Pipecat.

Pros & cons

Audio-native eval (~95% human agreement)
Load-test 50K+ concurrent calls
Production call replay and regression
Integrates Vapi, Retell, LiveKit, Pipecat
SOC 2 Type II, HIPAA-ready

No public pricing or free tier
Focused on voice/chat agents
Newer company

Hamming

Coval

Maxim AI

LangWatch

Confident AI