Skip to content

Vector DBNeuML

txtai

All-in-one open-source embeddings database for semantic search and RAG.

Category
Vector DB
Pricing
FREE
Hosting
Self-host
Platforms
APICLI
Models
Model-agnostic
Verified
Jun 13, 2026

txtai is an open-source embeddings database that unifies dense and sparse vector indexes, graph networks, and a relational database into a single store for semantic search and LLM workflows. Beyond vector search it ships retrieval-augmented generation, pipelines, and agents, and can be built with Python or declarative YAML. It runs locally or self-hosted and exposes API bindings for JavaScript, Java, Rust, and Go.

Pros & cons

  • Fully open source (Apache-2.0), runs locally
  • Vector + graph + SQL in one store
  • Build with Python or YAML
  • API bindings for JS, Java, Rust, Go
  • Built-in RAG, agents, and pipelines
  • Maintained by a small team, not a big vendor
  • Smaller ecosystem than Pinecone/Weaviate
  • No managed cloud offering
  • More concepts than a plain vector DB

Tags

View all Vector DB
  • View Chroma details
    Vector DBFREEMIUMOpen core

    Chroma

    Chroma

    Embedded vector DB. Pip-install, prototype, scale later.

    The low-friction starting point — Chroma runs embedded inside your Python process or as a hosted service. Great for prototypes and small-to-medium RAG apps; upgrade to a managed option when you outgrow it.

    Worth knowing

    Raised an $18M seed (Quiet Capital, 2023) with angels including Naval Ravikant, Guillermo Rauch, and Amjad Masad.

    • open-source
    • embedded
    • prototype
    • python
  • View LanceDB details
    Vector DBFREEMIUMOpen core

    LanceDB

    LanceDB

    Embedded multimodal vector database on the Lance format.

    An open-source retrieval engine for AI built on the Lance columnar format. It runs in-process alongside your app — no separate server — and stores, indexes, and searches vectors, metadata, and multimodal data (text, images, video) with vector, full-text, and SQL queries. A managed enterprise lakehouse tier scales the same engine to petabytes.

    Worth knowing

    Its CEO co-authored the pandas library; the YC-backed startup counts Midjourney as a customer.

    • vector-search
    • multimodal
    • embedded
    • lance
    • +1
  • View Weaviate details
    Vector DBFREEMIUMOpen core

    Weaviate

    Weaviate

    Open-source vector database with built-in vectorisers.

    Cloud-native vector DB that can compute embeddings inline — pass raw text in, store vectors out. Strong hybrid (BM25 + vector) search; self-hostable or managed.

    Worth knowing

    Founded in Amsterdam in 2019 under the name SeMI Technologies; raised a $50M Series B led by Index Ventures in 2023.

    • open-source
    • self-hosted
    • hybrid-search
    • rag
  • View Qdrant details
    Vector DBFREEMIUMOpen core

    Qdrant

    Qdrant

    Open-source, Rust-based vector DB. Fast, predictable, self-hostable.

    Vector database written in Rust with a strong focus on filtering, payloads, and predictable latency at scale. Self-host on a single binary or use the managed cloud.

    Worth knowing

    Berlin-based, founded 2021; raised a $28M Series A led by Spark Capital in January 2024.

    • open-source
    • rust
    • self-hosted
    • fast