Loading…
Memory · Graphlit
One API for AI agent memory: ingest, extract, store, retrieve.
A cloud-native platform that gives AI agents semantic memory and operational context through a single API. It ingests documents, audio, video, and web pages, extracts entities and relationships, and handles storage and retrieval so you don't assemble the RAG pipeline yourself. Integrates with frontier models from OpenAI, Anthropic, and Google for extraction.
Model support
Where it runs
Tags
Related in Memory
Pieces
On-device AI memory and copilot that recalls your work context.
Pieces is a developer-focused AI tool that automatically captures your work context — code snippets, docs, chats, and links — across your apps and surfaces it through a copilot. Its Long-Term Memory engine records a rolling window of activity at the OS level, enabling time-based questions about what you were doing. It runs on-device and air-gapped from the cloud by default, with optional cloud LLMs, and plugs into desktop, VS Code, JetBrains, and the browser.
AI insight: Its on-device Long-Term Memory captures a rolling ~9 months of work context, so the copilot can answer "what was I working on last week?"
Cognee
Open-source memory for AI agents.
An open-source semantic memory layer for AI agents. Cognee ingests documents, relational data, and system context, then runs an Extract-Cognify-Load pipeline that uses an LLM to build a knowledge graph with embeddings and relationships. Agents query it for durable, cross-session context that captures how concepts connect. Self-host the Python SDK for free, or use the managed cloud tiers.
AI insight: Pairs vector search with an LLM-built knowledge graph so recall can follow relationships, not just nearest-neighbor similarity.
Plastic Labs
Continual learning memory for stateful agents. Better context, fewer tokens.
A memory and user-personalization layer for AI agents that keeps reasoning about each user across sessions, so apps get richer context without stuffing whole histories into the prompt. It models peers and sessions, runs background inference to derive durable facts, and answers natural-language questions about a user at query time. Available as a managed API or self-hosted FastAPI server, with Python and TypeScript SDKs.
AI insight: AGPL-3.0 core that runs background inference between sessions, storing derived facts about each user rather than raw transcripts.
Memobase
User profile-based long-term memory for LLM applications.
Memobase gives AI apps persistent, per-user memory by distilling conversations into a structured user profile and a chronological event timeline, rather than storing raw transcript embeddings. It exposes Python, Node, and Go SDKs plus a REST API and an MCP server, and is open source (Apache-2.0) with a managed cloud option. Built on FastAPI, Postgres, and Redis for low-latency retrieval.
AI insight: Models memory as a structured user profile plus an event timeline, not embedded transcript chunks like most memory layers.
Supermemory
Memory API that gives any AI agent long-term recall.
Supermemory is a memory and context engine for AI apps. It ingests documents, chat histories, and connector data (Drive, Gmail, Notion), turns them into a searchable store, and serves relevant context back to agents over a single API. It works with any model and ships an MCP server alongside official SDKs.
AI insight: MIT-licensed memory engine you can self-host or call as a managed API — one recall endpoint that works across any model.
Zep
Temporal knowledge-graph memory for AI agents.
Memory layer that gives agents long-term context by building a temporal knowledge graph from chat history and business data, tracking how facts evolve over time. It's powered by Graphiti, Zep's Apache-2.0 open-source temporal graph engine, with Zep Cloud offering a managed, credit-based service on top. Used to keep agent context relevant as conversations and data grow.
AI insight: Built on its open-source Graphiti engine, it stores a temporal knowledge graph that versions how facts change over time, not flat snapshots.
Letta
Stateful agents with structured memory. Successor to MemGPT.
Open-source framework for building stateful agents — memory blocks, context-window management, tool-use primitives baked in. Useful as a reference architecture for long-running agents.
AI insight: The productized successor to the MemGPT paper — agents edit their own memory blocks to manage a finite context window.
Mem0
Long-term memory layer for AI agents. Self-hostable.
Persistent memory store + retrieval pipeline for agent applications. Handles per-user/per-session/per-agent scope cleanly; pairs with OpenAI, Anthropic, and local models.
AI insight: Stores distilled facts rather than whole transcripts, so an agent's memory stays small and relevant as conversations grow.