Skip to content

ObservabilityAgentOps

AgentOps

Observability and tracing built for AI agents.

Pricing
FREEMIUM
Source
Open core
Hosting
Hybrid
Platforms
WebAPI
Models
Model-agnostic
Verified
Jun 15, 2026

AgentOps is a developer platform for monitoring, debugging, and evaluating AI agents. It records every LLM call, tool use, and decision in a replayable session trace, with time-travel debugging, token and cost tracking, and agent benchmarking. Its open-source SDK drops into Python or TypeScript agents in two lines and integrates with frameworks like CrewAI, LangChain, Autogen, and the OpenAI Agents SDK.

Pros & cons

  • Open-source MIT SDK, two-line setup
  • 400+ LLM and framework integrations
  • Session replay with time-travel debugging
  • Token and cost tracking per run
  • Free tier to start
  • Python/TypeScript SDK-centric
  • Full analytics rely on the hosted dashboard
  • Younger than general-purpose APM tools

Tags

Further reading

View all Observability
  • View Langfuse details
    ObservabilityFREEMIUMOpen core

    Langfuse

    Langfuse

    Open-source LLM observability. Self-hostable, OpenTelemetry-native.

    Tracing, evals, prompt management, and dataset tooling for LLM apps — self-host on your own infra or use Langfuse Cloud. The open-source default when you want full ownership of your observability stack.

    Worth knowing

    Y Combinator W23 startup; acquired by ClickHouse in January 2026.

    • open-source
    • tracing
    • evals
    • self-hosted
  • View Arize Phoenix details
    ObservabilityFREEMIUM

    Arize Phoenix

    Arize AI

    LLM tracing + evaluation. Strong on retrieval debugging.

    Phoenix is Arize's observability platform — run locally in a notebook or as a hosted service. Especially strong for inspecting RAG pipelines, finding bad chunks, and tracking retrieval quality over time.

    Worth knowing

    Licensed under Elastic License 2.0 (source-available), not OSI open-source — despite its open GitHub repo.

    • tracing
    • rag
    • retrieval-debugging
  • View Helicone details
    ObservabilityFREEMIUMOpen core

    Helicone

    Helicone

    Drop-in LLM proxy with logging, caching, and cost tracking.

    One-line integration — change your OpenAI/Anthropic base URL and get a dashboard with every prompt, response, latency, and dollar tracked. Adds caching and rate-limit handling without code changes.

    Worth knowing

    YC W23 startup acquired by docs platform Mintlify in March 2026, having processed over 14 trillion tokens for 16,000+ orgs.

    • proxy
    • logging
    • caching
    • cost-tracking
  • View Traceloop details
    ObservabilityFREEMIUMOpen core

    Traceloop

    Traceloop

    LLM observability built on OpenTelemetry.

    A reliability platform for LLM apps: its open-source OpenLLMetry SDK instruments LLM, vector-DB, and framework calls as standard OpenTelemetry spans, which Traceloop's hosted dashboard turns into traces, cost/latency analytics, and quality monitoring. Because the data is plain OTel, you can pipe it to existing observability stacks instead of a proprietary one.

    Worth knowing

    A Y Combinator (W23) startup behind OpenLLMetry; acquired by ServiceNow in 2026.

    • observability
    • opentelemetry
    • tracing
    • open-source
    • +1