CocoIndex

Incremental data framework for fresh AI context.

Categories: Data OpsVector DB
Pricing: FREE
Source: Open source
Platforms: API
Models: BYO key / model
Verified: Jun 19, 2026

CocoIndex is an open-source data transformation framework that keeps AI agents and LLM apps supplied with continuously fresh, structured context. It turns sources like codebases, PDFs, databases, and Slack into vector or graph stores, and reprocesses only what changed (delta-only) with parallel execution by default. A Rust core drives reliability while pipelines are defined declaratively in Python, with end-to-end lineage and an observability UI called CocoInsight.

Pros & cons

Apache-2.0 with a Rust core
Incremental, delta-only processing
Declarative Python pipelines
End-to-end lineage + CocoInsight UI

Younger, smaller ecosystem
Python-centric authoring
Bring your own model/embedding cost

CocoIndex

Docling

Unstructured

LlamaIndex