cognee

← Back to Project List

cognee is an open source long-term memory platform for AI agents, which processes data such as documents, sessions, user preferences, and business knowledge into a knowledge memory layer that can be semantically retrieved and reasoned about. It is neither a common vector library nor a simple knowledge graph tool. Instead, it combines the relational store, vector store, and graph store to provide a context for the sustainable evolution of the Agent around the four operations of 'remember / recall / improve/forget. Before sales, it is suitable for cutting into the customer scenario where "enterprise AI assistants have no long-term memory, RAG only finds fragments, and Agent cannot accumulate experience through multiple rounds of collaboration.

1. Project Overview

Dimension	Information
Project	topoteretes/cognee
The Open-Source AI Memory Platform for Agents
Key Value	Provides cross-session long-term memory, self-hosted knowledge graph, vector graph hybrid retrieval, and context management for AI Agent
Core Operations	'memorber', 'recall', 'improve', 'forget'
Deployment form	Python SDK, CLI, local UI, Docker/API Server, MCP Server, Cognee Cloud, Rust/TypeScript client
Storage Architecture	Relational store Vector store Graph store
Default local stack	Lightweight local components such as SQLite, LanceDB, and Kuzu
PostgreSQL/PGVector, Neo4j, Neptune, Redis, S3, Modal, Railway, Fly.io, Render, etc
Model support	OpenAI default path; extensible Gemini, Anthropic, Ollama, Groq, Mistral, HuggingFace, llama.cpp, Azure, etc.
License	Apache-2.0
Important Papers	Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
Information Review Date	2026-06-30

Note: GitHub anonymous API has triggered rate limit during this round of research, so precise stars/forks/issues has not been written into this note. It is recommended to open GitHub page to review the real-time heat and release status before formal external reference. However, judging from README, official documents, and Docker/CLI/Cloud/SDK instructions, cognee is no longer a demo-only gadget, but is trying to do a complete Agent memory infrastructure.

2. Schematic diagram of the project

3. What can it mainly do

3.1 provide "long-term memory" to Agent"

Many enterprises will encounter a typical problem when doing Agent: Agent can perform well in one session, but after changing a session, a task and a user, the previous context, preference, experience and business knowledge will be lost. Ordinary RAG can retrieve documents, but it does not necessarily express "who is related to whom", "what an event caused", "what this user preferred in the past" and "which solutions have been successful".

The core value of cognee is to turn these scattered data into long-term memory layers that agents can call:

-Document knowledge can be entered into permanent atlas memory.

-Session preferences can enter the session memory first.

-Session experience can be subsequently bridged to the permanent map via 'improve.

-Multiple Agents can share or call specific data sets under isolation permissions.

Pre-sales speech can be said like this:

ordinary RAG solves the problem of "finding document fragments". cognee pays more attention to "whether Agent can remember, understand relationships, continuously learn and reuse experience in subsequent tasks for a long time".

3.2 Four user-level operations: remember / recall / improve / forget

The user workflow of cognee v1.0 is abstracted into four operations:

Action	Role	Pre-Sales Explanation
'remember'	Save new memory to enter permanent map or session memory	Save documents, sessions, preferences, business knowledge
'recall'	Query stored memory, support session-aware and graph-backed retrieval	The agent calls long-term memory when answering a response.
'IMPOVE'	Enriches existing memory to bridge session history to a permanent map	allows the system to continuously evolve from historical interactions and feedback
'forget'	Delete memories by item, dataset, and user range	Meet data cleaning, permission, and forgetting requirements

This abstraction is very useful for pre-sales, because customers don't need to understand the internal graph/vector details at the beginning, just the four steps of "remember, recall, improve, and forget.

3.3 Vector Retrieval Knowledge Graph Relational Reasoning

The official architecture document clearly states that cognee combines three types of storage:

Storage	What is responsible	Why is it important
Relational store	Document, chunk, source, provenance, and metadata	Resolves the traceability problem of "where data comes from and how to associate with the source document"
Vector store	embeddings and semantic similarity	Solve the retrieval problem of "different wording but similar semantics"
Graph store	Entities and Relationships	Solve the problem of "how to connect objects and how to reason along relationships"

Ordinary vector RAG is better at "finding similar text". The selling point of cognee is to retain "semantic similarity" and "structural relationship" at the same time: for example, in a customer support scenario, not only a FAQ is found, but also users, work orders, products, historical solutions, invoices, and failed actions can be linked together.

3.4 support permanent memory and conversational memory

The example of the cognee README shows two types of 'memorber ':

await cognee.remember("Cognee turns documents into AI memory.")
await cognee.remember("User prefers detailed explanations.", session_id="chat_1")

The first is permanent knowledge map memory, and the second is session memory. For pre-sales, this is an important distinction:

-Permanent memory: company knowledge base, product documentation, historical cases, expert experience.

-Session memory: a user preference, a task context, and the current session state of an agent.

If customers are concerned about privacy and permissions, they need to further design the isolation method of dataset, user, and agent.

3.5 supports multiple retrieval modes

The CLI documentation mentions that 'recall' supports multiple query types:

Query Type	What is suitable for
'GRAPH_COMPLETION'	Generates more complete answers based on graphs and context, suitable for questions and answers by default.
'RAG_COMPLETION '	Regular RAG answer based on chunk
'CHUNKS'	Returns the original text fragment, suitable for review and reference
'SummarIES'	Back to summary, suitable for quick review
'CYPHER'	Direct graph query, suitable for technicians who understand graph databases

Pre-sales advice: Demonstrate the difference between 'GRAPH_COMPLETION and RAG_COMPLETION to the business side; Show 'CHUNKS', 'CYPHER' and provenance to the technical side.

3.6 can run the entire memory layer on the Postgres

There is a very important selling point of production deployment in README: cognee 1.0 can put the entire memory layer into a Postgres instance:

Memory layer	traditional stack	cognee on Postgres
Relationships	Neo4j or other graph databases	Postgres graph backend of cognee
Embeddings	Independent vector database	pgvector
Sessions	Redis	SQL session-cache backend
Metadata	Relational database	Same Postgres

This is very attractive to enterprise customers, because many customers do not want to maintain the Neo4j vector library Redis metadata DB multiple sets of infrastructure in the first place. Starting with Postgres and then replacing Neo4j, Neptune, Redis, Qdrant, Weaviate and other components according to scale is an easier route to land.

3.7 have CLI, local UI, MCP, Cloud and multi-language SDK

cognee is not just a Python package:

-Python SDK: suitable for embedding existing Agent/RAG applications.

-CLI:'cognee-cli remember/recall/improve/forget', suitable for PoC and scripting.

-Local UI:'cognee-cli -ui', which launches the backend and React app.

-Docker API Server: suitable for service.

-MCP Server: suitable for agent tools such as Claude Desktop, Codex, and Cursor.

-Cognee Cloud: Hosting experience.

-Rust client:'cognee-rs '.

-TypeScript client:'@ cognee/cognee-ts'.

-Claude Code plugin / OpenClaw plugin: Precipitates the usage traces and answers of the Agent into memory.

This ecological direction indicates that it wants to be an "Agent memory infrastructure", not just a Python notebook demo.

4. Applicable Scenario

4.1 Enterprise Knowledge Assistant: Upgrade from "Document Question and Answer" to "Corporate Brain"

Suitable for customers:

-Corporate Knowledge Management Team

-Pre-sales/consulting team

-R & D Support Team

-Customer Service Knowledge Base Team

Pain point:

-There are many documents, ordinary RAG can only recall fragments.

-There is a relationship between customer case, product capability, delivery experience, FAQ, work order, but it is not structured.

-After the employee leaves or the project is over, it is difficult to precipitate experience.

Cognee's cut-in:

-Use 'memorber' to access documents, FAQs, cases and meeting minutes.

-Use maps to express the relationship between customers, products, problems and solutions.

-Support business Q & A with 'recall.

-Use 'improve' to precipitate new session experience and feedback.

Pre-sales value:

Upgrade the enterprise knowledge base from "folder search box" to "agent memory layer for sustainable learning".

4.2 Customer Service/After-sales Agent: Avoid Repeated Mistakes

README gives an example of Customer Support Agent: users ask about invoice exceptions and unresolved problems. cognee can track historical interactions, failed actions, resolved cases, product history, and retrieve similar resolved cases.

Customer Value:

-New customer service can reuse historical solutions.

-Agent can see what has happened to this customer in the past.

-For repeated problems, the system can recommend a more confident processing path.

-Processing results can continue to be written back as memory.

PoC can be designed like this:

-Enter 50 historical work orders, 10 FAQs and 5 product change instructions.

-Select 10 real customer service questions.

-Compare whether the answers of ordinary RAG and cognee are more relevant to historical cases.

4.3 SQL Copilot/Expert Experience Reuse

The second example of README is Expert Knowledge Distillation: it precipitates expert SQL, workflow mode, schema structure and successful implementation to help junior analyst reuse expert experience.

This kind of scenario is perfect for pre-sales:

-Customers have senior analysts, but newcomers are slow to get started.

-SQL/BI/data caliber often makes repeated errors.

-Knowledge is not only in documents, but also in historical queries and expert operations.

COGNEE can be used as an "expert experience memory layer", allowing the Agent to find past similar tasks and successful queries based on the current schema, and then adapt to the current problem.

4.4 Developer Agent / Claude Code Memory

The cognee has a Claude Code plugin. The plug-in is captured in the Claude Code lifecycle:

-session start

-user prompt

-tool traces

-assistant response

-Memory protection before context compact

-sync to permanent atlas when end is session

Suitable for customers:

-Internal AI Coding Platform

-R & D knowledge base

-Code Helper memory enhancement

-Multi-project development experience precipitation

Presentation:

Many AI Coding tools have to reinterpret the project context every time. cognee can precipitate project constraints, historical decisions, tool use traces, and answers into a long-term memory that can be recalled.

4.5 Multi-Agent Collaboration Platform

CLI documents include 'agents create/list/register/connections' and other commands. Each agent can have its own agent user and API key, and authorize by dataset.

This is suitable for doing:

-Data isolation for multi-agent platforms.

-Different Agents share corporate knowledge but retain their own conversational memories.

-Agent connection status and memory sources management.

-Multi-tenant knowledge assistant.

5. Not suitable for the scene

Scenario	Reason
Only simple FAQ search is required	Ordinary vector RAG or search engine is more lightweight, and the map/memory layer of cognee may be too heavy
Customers do not have long-term memory requirements for Agent.	If it is only a one-time document question and answer, the advantages of cognee will not be brought into full play.
Unwilling to maintain the model, database, and permission configuration	Cognee can run locally and lightly, but the production level still needs to configure LLM, embedding, storage, authentication, and security items
High demand for verifiability of answers but unwilling to do evaluation	Atlas LLM can still make mistakes and must establish a sample set and manual review
There is a mature knowledge graph platform and only the query UI is missing	cognee is more Agent memory/RAG infrastructure, not a substitute for the traditional enterprise-level graph platform
Requires strict multi-tenant SaaS but no security reinforcement	By default, access control is provided, but JWT, API key, file access, Cypher permissions, etc. still need to be checked for production deployment

6. Core Competence List

Capabilities	Descriptions	Pre-Sales Value
AI Agent's long-term memory	Preserve knowledge, preferences, interactions, and experiences across session	Solve Agent's "Every time I see you for the first time" problem
'remember'	Ingest data in one step and build a search-ready memory	Lower the access threshold
'recall'	Retrieves memory based on session or map	Supports Q & A and context injection
'improve'	Rich memory that bridges session to a permanent map	Let the system evolve with use
'forget'	Deletes memory in the item/dataset/user range	Meets data cleansing and compliance requirements
three-storage architecture	relational vector graph	takes into account provenance, semantic similarity, and relational reasoning
Postgres single stack	Graph, vector, session and metadata can be put into Postgres	Reduce the complexity of enterprise deployment
Multi-Provider	OpenAI, Gemini, Anthropic, Ollama, Groq, Mistral, etc.	Supports cloud and on-premises model policies
Multiple storage backends	SQLite, LanceDB, Kuzu, Postgres, Neo4j, Neptune, Qdrant, etc.	Smoothly move from local PoC to production
CLI	'cognee-cli remember/recall/improve/forget'	Easy for quick demonstration and scripting
MCP Server	Allows MCP clients to call cognee memory	Suitable for Claude, Codex, Cursor, etc.
Python/Rust/TS client	Multilingual SDK	Easy access to different technology stacks
Security controls	Auth, multi-user isolation, API key hash, Cypher permissions, and file path switches	Suitable for enterprise security discussions

7. Architecture/Deployment/Integration

7.1 high-level architecture

flowchart LR A["原始数据/文档/会话/URL/S3"] --> B["Loaders / Chunkers"] B --> C["Tasks / Pipelines"] C --> D["DataPoints"] D --> E["Relational Store
文档/Chunk/Provenance"] D --> F["Vector Store
Embeddings/语义检索"] D --> G["Graph Store
实体/关系"] E --> H["Recall"] F --> H G --> H H --> I["Agent / App / CLI / MCP"]

7.2 Local PoC Deployment

Minimum installation:

uv pip install cognee

Configure the LLM:

export LLM_API_KEY="YOUR_OPENAI_API_KEY"

Python example:

import cognee
import asyncio

async def main():
    await cognee.remember("Cognee turns documents into AI memory.")
    results = await cognee.recall("What does Cognee do?")
    for result in results:
        print(result)

asyncio.run(main())

CLI example:

cognee-cli remember docs/company-handbook.pdf --dataset-name handbook
cognee-cli recall "Who owns the rollout plan?" --datasets handbook
cognee-cli improve --dataset-name handbook
cognee-cli forget --dataset handbook

7.3 Docker / API / UI

Docker Compose:

cp .env.template .env
docker compose up
docker compose --profile ui up
docker compose --profile mcp up
docker compose --profile postgres up
docker compose --profile neo4j up

Pre-build image:

echo 'LLM_API_KEY="YOUR_OPENAI_API_KEY"' > .env
docker run --env-file ./.env -p 8000:8000 --rm -it cognee/cognee:main
docker run -e TRANSPORT_MODE=http --env-file ./.env -p 8000:8000 --rm -it cognee/cognee-mcp:main

Local UI:

cognee-cli -ui

The UI launches the backend 'http:// localhost:8000 'and the React app 'http:// localhost:3000' and attempts to start the MCP server.

7.4 Postgres production route

Installation:

pip install "cognee[postgres]"

Environment variables:

DB_PROVIDER=postgres
VECTOR_DB_PROVIDER=pgvector
GRAPH_DATABASE_PROVIDER=postgres
CACHE_BACKEND=postgres

DB_HOST=localhost
DB_PORT=5432
DB_USERNAME=cognee
DB_PASSWORD=cognee
DB_NAME=cognee_db

Pre-sales advice: For most enterprise customers, the Postgres route is easier to accept than the introduction of Neo4j Redis vector library at the beginning. After the business scale, graph query complexity and vector scale come up, consider replacing special components.

Key Points of 7.5 Security Configuration

The official security document deserves a pre-sales focus reminder:

Configuration	Default/Description	Production Recommendations
'ENABLE_BACKEND_ACCESS_CONTROL'	Default true, multi-tenant isolation authentication	Production suggestions remain enabled
'FASTAPI_USERS_JWT_SECRET'	Default value is not secure	Must be replaced with a long random key
'HASH_API_KEY'	Default false,API key plaintext storage	Production recommendation set to true
'ACCEPT_LOCAL_FILE_PATH'	Default true	When it is used as a backend service, false is recommended to prevent users from reading any local files.
'ALLOW_CYPHER_QUERY'	Default true	It is recommended to turn off on demand in multi-user environment to avoid the risk of original graph query
'NEO4J_ENCRYPTION_KEY'	Default test_key	Must be replaced when using Neo4j Aura

This part can be used as a starting point for communication with the customer security team: cognee has a security switch, but it cannot run naked online by default.

8. What can I say before sales

Business-oriented

The biggest problem with many AI assistants nowadays is not that they won't answer, but that they won't remember for a long time. What has been asked, solved and fed back today will be explained again next time. Cognee can deposit enterprise documents, historical work orders, expert experience and conversation preferences into Agent's long-term memory, so that AI assistants not only "check data", but gradually form "company brains".

Business Value:

-Reduce repetitive interpretation background.

-Let newcomers reuse expert experience.

-Let customer service/after-sales Agent refer to historical successful cases.

-Upgrade the enterprise knowledge base from document retrieval to relational memory.

-Support knowledge forgetting, data isolation and subsequent governance.

Technology-oriented

cognee is an agent memory layer that combines relational, vector, and graph storage. It retains chunks and sources, generates embeddings for semantic retrieval, and extracts entity relationships into the graph. The upper layer uses 'remember/recall/improve/forget' to simplify the call, and the lower layer can switch from the local stack of SQLite/LanceDB/Kuzu to the production stack of Postgres/PGVector/Neo4j.

Technical value:

-More structural relationships than unidirectional libraries.

-Easier access to LLM/Agents than traditional knowledge maps.

-Faster PoC than self-developed memory layer.

-CLI, SDK, MCP, Docker are available, and there are many integration paths.

-Apache-2.0 is relatively friendly for business integration.

For management

cognee is suitable for long-term memory PoC as an enterprise AI assistant first, and does not need to build a huge knowledge graph platform at the beginning. We can choose a scenario with clear value, such as customer service history cases, pre-sales database, research and development knowledge base, and use 1-2 weeks to verify whether it can improve the quality of the answer and reduce duplication of labor.

9. Frequently Asked Customer Questions


What is the difference between cogne and ordinary RAG?	Ordinary RAG mainly searches for document fragments by vectors; cogne simultaneously maintains document sources, vector semantics, and entity relationships, and supports long-term memory, session memory, memory improvement, and forgetting.
Is it a knowledge graph database?	Not a substitute for a traditional graph database. It is more like the Agent memory layer, which uses graph databases/graph backends to express relationships, as well as vector and relational databases.
Can it be privatized?	Can. The local default lightweight stack can run PoC, and production can be deployed in Docker, Postgres, Neo4j, API Server, MCP, etc.
Can I use only Postgres?	README clearly shows that cognee 1.0 can put relationships, vectors, session, and metadata into the Postgres/pgvector system, which is suitable for reducing deployment complexity.
Which models are supported?	The default OpenAI path includes Gemini, Anthropic, Ollama, Groq, Mistral, HuggingFace, llama.cpp, and Azure extras/config routes.
How about security?	Configurations such as authentication, multi-tenant isolation, API key hash, local file access switch, and Cypher query switch are available. However, the production deployment must be reinforced one by one.
Is it suitable for Chinese?	The official does not take Chinese ability as the core selling point. Chinese effect mainly depends on the LLM, embedding, chunking and entity extraction effect, need to use Chinese sample PoC.
Can I connect Claude Code/Codex?	README clearly has Claude Code plugin and MCP Server. Codex can be integrated through MCP/CLI/API, and needs to be tested according to the actual client capability.

10. PoC Recommendations

PoC 1: Pre-Sales Data Long-Term Memory Assistant

Input material:

-20 product proposals

-20 customer cases

-10 copies of competition information

-10 historical pre-sales questions and answers

Validation issues:

-What are the common pain points of customers in a certain industry?

-In which cases has a product capability been used?

-What is the difference between a competition and our scheme?

-What concerns were mentioned in a customer's historical communication?

Success Metrics:

-The answer can quote the correct information.

-Ability to link cases, product capabilities and customer issues.

-Relational questions answer more completely than ordinary RAG.

-Support to isolate different customer data by dataset.

PoC 2: Customer Service History Case Memory

Input material:

-100 historical work orders

-FAQ documentation

-Product change records

-Processing results and customer feedback

Validation issues:

-Are there similar historical cases of the current issue?

-Which treatments have failed?

-Which version change could cause this problem?

-What processing path should be recommended for the new job?

Success Metrics:

-Recall accuracy for similar cases.

-The answer can explain the reason for the association.

-Can reduce the time of repeated investigation by customer service.

PoC 3:AI Coding Project Memory

Input material:

-Project Readme

-Architecture Description

-Code specification

-Historical PR/issue summary

-Claude Code/Codex session record

Validation issues:

-What are the architectural constraints of the project?

-Why was a module designed like this before?

-What code styles should new features follow?

-How were similar bugs fixed in the past?

Success Metrics:

-Proactive recall of project context in new sessions.

-Reduce duplication in explaining project background.

-Be able to locate relevant historical decisions.

11. Risks and Considerations

Risk	Description	Response Recommendations
LLM Extraction Quality	Map and Memory Quality Depends on LLM, embedding, chunking and pipeline	Evaluate with Real Samples, Not Just Demo
Graph error propagation	Error entities or relationships may affect subsequent recall	Add manual sampling, data rollback, and 'forget' policies
Security default items need to be strengthened	JWT secret, API key plaintext, local file path, Cypher query, etc. must be checked	Do security configuration checklist before going online
Multi-storage complexity	Deployment and troubleshooting costs brought by relational/vector/graph multi-layer	PoC uses the default stack, and production is preferred Postgres single stack
Cost	LLM/embedding calls, graph building, and improve may all incur costs	Estimate the number of chunks, calls, and concurrency first
Chinese/Industry Ontology	Chinese Entity Extraction and Domain Relationships may need customization	Prepare Industry Samples and Ontology/ontology
Does not equal to complete knowledge middle-end	Lack of complete product capabilities such as enterprise-level UI, approval, governance, authority audit, etc.	As a memory/RAG component, rather than an overall platform replacement

12. My Pre-Sales Judgment

Cognee is one of the most important projects in this batch, because it has hit a very real pain point in the Agent industry: Agent can not only call tools, but also remember, continuously learn and understand enterprise knowledge according to relationships for a long time.

It is suitable for pre-sales positioning is not "another RAG framework",:

Agent long-term memory layer enterprise knowledge mapping memory vector/graph hybrid retrieval infrastructure.

The most suitable for three types of customers:

Customers who have done RAG, but found that the response lacks context and relational reasoning.
Customers who are working on an enterprise Agent platform and need memory layer.
Customers with "experience precipitation" needs such as customer service, pre-sales, research and development, data analysis, etc.

It is not recommended to commit to production to replace the customer's existing knowledge platform. A more stable approach is to use 1-2 high-value scenarios for PoC, especially "historical cases, current problem relationship reasoning. If the cognee is significantly better than the ordinary vector RAG on these issues, it is easy to bring the customer to the "long-term memory/company brain" scenario narrative.