1. Project Overview
| Dimension | Information |
|---|---|
| Project | topoteretes/cognee |
| The Open-Source AI Memory Platform for Agents | |
| Key Value | Provides cross-session long-term memory, self-hosted knowledge graph, vector graph hybrid retrieval, and context management for AI Agent |
| Core Operations | 'memorber', 'recall', 'improve', 'forget' |
| Deployment form | Python SDK, CLI, local UI, Docker/API Server, MCP Server, Cognee Cloud, Rust/TypeScript client |
| Storage Architecture | Relational store Vector store Graph store |
| Default local stack | Lightweight local components such as SQLite, LanceDB, and Kuzu |
| PostgreSQL/PGVector, Neo4j, Neptune, Redis, S3, Modal, Railway, Fly.io, Render, etc | |
| Model support | OpenAI default path; extensible Gemini, Anthropic, Ollama, Groq, Mistral, HuggingFace, llama.cpp, Azure, etc. |
| License | Apache-2.0 |
| Important Papers | Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning |
| Information Review Date | 2026-06-30 |
Note: GitHub anonymous API has triggered rate limit during this round of research, so precise stars/forks/issues has not been written into this note. It is recommended to open GitHub page to review the real-time heat and release status before formal external reference. However, judging from README, official documents, and Docker/CLI/Cloud/SDK instructions, cognee is no longer a demo-only gadget, but is trying to do a complete Agent memory infrastructure.
2. Schematic diagram of the project
2.1 Logo and Demo
2.2 Remember / Recall Working Schematic
2.3 Official Architecture Diagram
2.4 Demo Video Cover
3. What can it mainly do
3.1 provide "long-term memory" to Agent"
Many enterprises will encounter a typical problem when doing Agent: Agent can perform well in one session, but after changing a session, a task and a user, the previous context, preference, experience and business knowledge will be lost. Ordinary RAG can retrieve documents, but it does not necessarily express "who is related to whom", "what an event caused", "what this user preferred in the past" and "which solutions have been successful".
The core value of cognee is to turn these scattered data into long-term memory layers that agents can call:
-Document knowledge can be entered into permanent atlas memory.
-Session preferences can enter the session memory first.
-Session experience can be subsequently bridged to the permanent map via 'improve.
-Multiple Agents can share or call specific data sets under isolation permissions.
Pre-sales speech can be said like this:
ordinary RAG solves the problem of "finding document fragments". cognee pays more attention to "whether Agent can remember, understand relationships, continuously learn and reuse experience in subsequent tasks for a long time".
3.2 Four user-level operations: remember / recall / improve / forget
The user workflow of cognee v1.0 is abstracted into four operations:
| Action | Role | Pre-Sales Explanation |
|---|---|---|
| 'remember' | Save new memory to enter permanent map or session memory | Save documents, sessions, preferences, business knowledge |
| 'recall' | Query stored memory, support session-aware and graph-backed retrieval | The agent calls long-term memory when answering a response. |
| 'IMPOVE' | Enriches existing memory to bridge session history to a permanent map | allows the system to continuously evolve from historical interactions and feedback |
| 'forget' | Delete memories by item, dataset, and user range | Meet data cleaning, permission, and forgetting requirements |
This abstraction is very useful for pre-sales, because customers don't need to understand the internal graph/vector details at the beginning, just the four steps of "remember, recall, improve, and forget.
3.3 Vector Retrieval Knowledge Graph Relational Reasoning
The official architecture document clearly states that cognee combines three types of storage:
| Storage | What is responsible | Why is it important |
|---|---|---|
| Relational store | Document, chunk, source, provenance, and metadata | Resolves the traceability problem of "where data comes from and how to associate with the source document" |
| Vector store | embeddings and semantic similarity | Solve the retrieval problem of "different wording but similar semantics" |
| Graph store | Entities and Relationships | Solve the problem of "how to connect objects and how to reason along relationships" |
Ordinary vector RAG is better at "finding similar text". The selling point of cognee is to retain "semantic similarity" and "structural relationship" at the same time: for example, in a customer support scenario, not only a FAQ is found, but also users, work orders, products, historical solutions, invoices, and failed actions can be linked together.
3.4 support permanent memory and conversational memory
The example of the cognee README shows two types of 'memorber ':
await cognee.remember("Cognee turns documents into AI memory.")
await cognee.remember("User prefers detailed explanations.", session_id="chat_1")
The first is permanent knowledge map memory, and the second is session memory. For pre-sales, this is an important distinction:
-Permanent memory: company knowledge base, product documentation, historical cases, expert experience.
-Session memory: a user preference, a task context, and the current session state of an agent.
If customers are concerned about privacy and permissions, they need to further design the isolation method of dataset, user, and agent.
3.5 supports multiple retrieval modes
The CLI documentation mentions that 'recall' supports multiple query types:
| Query Type | What is suitable for |
|---|---|
| 'GRAPH_COMPLETION' | Generates more complete answers based on graphs and context, suitable for questions and answers by default. |
| 'RAG_COMPLETION ' | Regular RAG answer based on chunk |
| 'CHUNKS' | Returns the original text fragment, suitable for review and reference |
| 'SummarIES' | Back to summary, suitable for quick review |
| 'CYPHER' | Direct graph query, suitable for technicians who understand graph databases |
Pre-sales advice: Demonstrate the difference between 'GRAPH_COMPLETION and RAG_COMPLETION to the business side; Show 'CHUNKS', 'CYPHER' and provenance to the technical side.
3.6 can run the entire memory layer on the Postgres
There is a very important selling point of production deployment in README: cognee 1.0 can put the entire memory layer into a Postgres instance:
| Memory layer | traditional stack | cognee on Postgres |
|---|---|---|
| Relationships | Neo4j or other graph databases | Postgres graph backend of cognee |
| Embeddings | Independent vector database | pgvector |
| Sessions | Redis | SQL session-cache backend |
| Metadata | Relational database | Same Postgres |
This is very attractive to enterprise customers, because many customers do not want to maintain the Neo4j vector library Redis metadata DB multiple sets of infrastructure in the first place. Starting with Postgres and then replacing Neo4j, Neptune, Redis, Qdrant, Weaviate and other components according to scale is an easier route to land.
3.7 have CLI, local UI, MCP, Cloud and multi-language SDK
cognee is not just a Python package:
-Python SDK: suitable for embedding existing Agent/RAG applications.
-CLI:'cognee-cli remember/recall/improve/forget', suitable for PoC and scripting.
-Local UI:'cognee-cli -ui', which launches the backend and React app.
-Docker API Server: suitable for service.
-MCP Server: suitable for agent tools such as Claude Desktop, Codex, and Cursor.
-Cognee Cloud: Hosting experience.
-Rust client:'cognee-rs '.
-TypeScript client:'@ cognee/cognee-ts'.
-Claude Code plugin / OpenClaw plugin: Precipitates the usage traces and answers of the Agent into memory.
This ecological direction indicates that it wants to be an "Agent memory infrastructure", not just a Python notebook demo.
4. Applicable Scenario
4.1 Enterprise Knowledge Assistant: Upgrade from "Document Question and Answer" to "Corporate Brain"
Suitable for customers:
-Corporate Knowledge Management Team
-Pre-sales/consulting team
-R & D Support Team
-Customer Service Knowledge Base Team
Pain point:
-There are many documents, ordinary RAG can only recall fragments.
-There is a relationship between customer case, product capability, delivery experience, FAQ, work order, but it is not structured.
-After the employee leaves or the project is over, it is difficult to precipitate experience.
Cognee's cut-in:
-Use 'memorber' to access documents, FAQs, cases and meeting minutes.
-Use maps to express the relationship between customers, products, problems and solutions.
-Support business Q & A with 'recall.
-Use 'improve' to precipitate new session experience and feedback.
Pre-sales value:
Upgrade the enterprise knowledge base from "folder search box" to "agent memory layer for sustainable learning".
4.2 Customer Service/After-sales Agent: Avoid Repeated Mistakes
README gives an example of Customer Support Agent: users ask about invoice exceptions and unresolved problems. cognee can track historical interactions, failed actions, resolved cases, product history, and retrieve similar resolved cases.
Customer Value:
-New customer service can reuse historical solutions.
-Agent can see what has happened to this customer in the past.
-For repeated problems, the system can recommend a more confident processing path.
-Processing results can continue to be written back as memory.
PoC can be designed like this:
-Enter 50 historical work orders, 10 FAQs and 5 product change instructions.
-Select 10 real customer service questions.
-Compare whether the answers of ordinary RAG and cognee are more relevant to historical cases.
4.3 SQL Copilot/Expert Experience Reuse
The second example of README is Expert Knowledge Distillation: it precipitates expert SQL, workflow mode, schema structure and successful implementation to help junior analyst reuse expert experience.
This kind of scenario is perfect for pre-sales:
-Customers have senior analysts, but newcomers are slow to get started.
-SQL/BI/data caliber often makes repeated errors.
-Knowledge is not only in documents, but also in historical queries and expert operations.
COGNEE can be used as an "expert experience memory layer", allowing the Agent to find past similar tasks and successful queries based on the current schema, and then adapt to the current problem.
4.4 Developer Agent / Claude Code Memory
The cognee has a Claude Code plugin. The plug-in is captured in the Claude Code lifecycle:
-session start
-user prompt
-tool traces
-assistant response
-Memory protection before context compact
-sync to permanent atlas when end is session
Suitable for customers:
-Internal AI Coding Platform
-R & D knowledge base
-Code Helper memory enhancement
-Multi-project development experience precipitation
Presentation:
Many AI Coding tools have to reinterpret the project context every time. cognee can precipitate project constraints, historical decisions, tool use traces, and answers into a long-term memory that can be recalled.
4.5 Multi-Agent Collaboration Platform
CLI documents include 'agents create/list/register/connections' and other commands. Each agent can have its own agent user and API key, and authorize by dataset.
This is suitable for doing:
-Data isolation for multi-agent platforms.
-Different Agents share corporate knowledge but retain their own conversational memories.
-Agent connection status and memory sources management.
-Multi-tenant knowledge assistant.
5. Not suitable for the scene
| Scenario | Reason |
|---|---|
| Only simple FAQ search is required | Ordinary vector RAG or search engine is more lightweight, and the map/memory layer of cognee may be too heavy |
| Customers do not have long-term memory requirements for Agent. | If it is only a one-time document question and answer, the advantages of cognee will not be brought into full play. |
| Unwilling to maintain the model, database, and permission configuration | Cognee can run locally and lightly, but the production level still needs to configure LLM, embedding, storage, authentication, and security items |
| High demand for verifiability of answers but unwilling to do evaluation | Atlas LLM can still make mistakes and must establish a sample set and manual review |
| There is a mature knowledge graph platform and only the query UI is missing | cognee is more Agent memory/RAG infrastructure, not a substitute for the traditional enterprise-level graph platform |
| Requires strict multi-tenant SaaS but no security reinforcement | By default, access control is provided, but JWT, API key, file access, Cypher permissions, etc. still need to be checked for production deployment |
6. Core Competence List
| Capabilities | Descriptions | Pre-Sales Value |
|---|---|---|
| AI Agent's long-term memory | Preserve knowledge, preferences, interactions, and experiences across session | Solve Agent's "Every time I see you for the first time" problem |
| 'remember' | Ingest data in one step and build a search-ready memory | Lower the access threshold |
| 'recall' | Retrieves memory based on session or map | Supports Q & A and context injection |
| 'improve' | Rich memory that bridges session to a permanent map | Let the system evolve with use |
| 'forget' | Deletes memory in the item/dataset/user range | Meets data cleansing and compliance requirements |
| three-storage architecture | relational vector graph | takes into account provenance, semantic similarity, and relational reasoning |
| Postgres single stack | Graph, vector, session and metadata can be put into Postgres | Reduce the complexity of enterprise deployment |
| Multi-Provider | OpenAI, Gemini, Anthropic, Ollama, Groq, Mistral, etc. | Supports cloud and on-premises model policies |
| Multiple storage backends | SQLite, LanceDB, Kuzu, Postgres, Neo4j, Neptune, Qdrant, etc. | Smoothly move from local PoC to production |
| CLI | 'cognee-cli remember/recall/improve/forget' | Easy for quick demonstration and scripting |
| MCP Server | Allows MCP clients to call cognee memory | Suitable for Claude, Codex, Cursor, etc. |
| Python/Rust/TS client | Multilingual SDK | Easy access to different technology stacks |
| Security controls | Auth, multi-user isolation, API key hash, Cypher permissions, and file path switches | Suitable for enterprise security discussions |
7. Architecture/Deployment/Integration
7.1 high-level architecture
文档/Chunk/Provenance"] D --> F["Vector Store
Embeddings/语义检索"] D --> G["Graph Store
实体/关系"] E --> H["Recall"] F --> H G --> H H --> I["Agent / App / CLI / MCP"]
7.2 Local PoC Deployment
Minimum installation:
uv pip install cognee
Configure the LLM:
export LLM_API_KEY="YOUR_OPENAI_API_KEY"
Python example:
import cognee
import asyncio
async def main():
await cognee.remember("Cognee turns documents into AI memory.")
results = await cognee.recall("What does Cognee do?")
for result in results:
print(result)
asyncio.run(main())
CLI example:
cognee-cli remember docs/company-handbook.pdf --dataset-name handbook
cognee-cli recall "Who owns the rollout plan?" --datasets handbook
cognee-cli improve --dataset-name handbook
cognee-cli forget --dataset handbook
7.3 Docker / API / UI
Docker Compose:
cp .env.template .env
docker compose up
docker compose --profile ui up
docker compose --profile mcp up
docker compose --profile postgres up
docker compose --profile neo4j up
Pre-build image:
echo 'LLM_API_KEY="YOUR_OPENAI_API_KEY"' > .env
docker run --env-file ./.env -p 8000:8000 --rm -it cognee/cognee:main
docker run -e TRANSPORT_MODE=http --env-file ./.env -p 8000:8000 --rm -it cognee/cognee-mcp:main
Local UI:
cognee-cli -ui
The UI launches the backend 'http:// localhost:8000 'and the React app 'http:// localhost:3000' and attempts to start the MCP server.
7.4 Postgres production route
Installation:
pip install "cognee[postgres]"
Environment variables:
DB_PROVIDER=postgres
VECTOR_DB_PROVIDER=pgvector
GRAPH_DATABASE_PROVIDER=postgres
CACHE_BACKEND=postgres
DB_HOST=localhost
DB_PORT=5432
DB_USERNAME=cognee
DB_PASSWORD=cognee
DB_NAME=cognee_db
Pre-sales advice: For most enterprise customers, the Postgres route is easier to accept than the introduction of Neo4j Redis vector library at the beginning. After the business scale, graph query complexity and vector scale come up, consider replacing special components.
Key Points of 7.5 Security Configuration
The official security document deserves a pre-sales focus reminder:
| Configuration | Default/Description | Production Recommendations |
|---|---|---|
| 'ENABLE_BACKEND_ACCESS_CONTROL' | Default true, multi-tenant isolation authentication | Production suggestions remain enabled |
| 'FASTAPI_USERS_JWT_SECRET' | Default value is not secure | Must be replaced with a long random key |
| 'HASH_API_KEY' | Default false,API key plaintext storage | Production recommendation set to true |
| 'ACCEPT_LOCAL_FILE_PATH' | Default true | When it is used as a backend service, false is recommended to prevent users from reading any local files. |
| 'ALLOW_CYPHER_QUERY' | Default true | It is recommended to turn off on demand in multi-user environment to avoid the risk of original graph query |
| 'NEO4J_ENCRYPTION_KEY' | Default test_key | Must be replaced when using Neo4j Aura |
This part can be used as a starting point for communication with the customer security team: cognee has a security switch, but it cannot run naked online by default.
8. What can I say before sales
Business-oriented
The biggest problem with many AI assistants nowadays is not that they won't answer, but that they won't remember for a long time. What has been asked, solved and fed back today will be explained again next time. Cognee can deposit enterprise documents, historical work orders, expert experience and conversation preferences into Agent's long-term memory, so that AI assistants not only "check data", but gradually form "company brains".
Business Value:
-Reduce repetitive interpretation background.
-Let newcomers reuse expert experience.
-Let customer service/after-sales Agent refer to historical successful cases.
-Upgrade the enterprise knowledge base from document retrieval to relational memory.
-Support knowledge forgetting, data isolation and subsequent governance.
Technology-oriented
cognee is an agent memory layer that combines relational, vector, and graph storage. It retains chunks and sources, generates embeddings for semantic retrieval, and extracts entity relationships into the graph. The upper layer uses 'remember/recall/improve/forget' to simplify the call, and the lower layer can switch from the local stack of SQLite/LanceDB/Kuzu to the production stack of Postgres/PGVector/Neo4j.
Technical value:
-More structural relationships than unidirectional libraries.
-Easier access to LLM/Agents than traditional knowledge maps.
-Faster PoC than self-developed memory layer.
-CLI, SDK, MCP, Docker are available, and there are many integration paths.
-Apache-2.0 is relatively friendly for business integration.
For management
cognee is suitable for long-term memory PoC as an enterprise AI assistant first, and does not need to build a huge knowledge graph platform at the beginning. We can choose a scenario with clear value, such as customer service history cases, pre-sales database, research and development knowledge base, and use 1-2 weeks to verify whether it can improve the quality of the answer and reduce duplication of labor.
9. Frequently Asked Customer Questions
| What is the difference between cogne and ordinary RAG? | Ordinary RAG mainly searches for document fragments by vectors; cogne simultaneously maintains document sources, vector semantics, and entity relationships, and supports long-term memory, session memory, memory improvement, and forgetting. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Is it a knowledge graph database? | Not a substitute for a traditional graph database. It is more like the Agent memory layer, which uses graph databases/graph backends to express relationships, as well as vector and relational databases. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Can it be privatized? | Can. The local default lightweight stack can run PoC, and production can be deployed in Docker, Postgres, Neo4j, API Server, MCP, etc. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Can I use only Postgres? | README clearly shows that cognee 1.0 can put relationships, vectors, session, and metadata into the Postgres/pgvector system, which is suitable for reducing deployment complexity. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Which models are supported? | The default OpenAI path includes Gemini, Anthropic, Ollama, Groq, Mistral, HuggingFace, llama.cpp, and Azure extras/config routes. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| How about security? | Configurations such as authentication, multi-tenant isolation, API key hash, local file access switch, and Cypher query switch are available. However, the production deployment must be reinforced one by one. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Is it suitable for Chinese? | The official does not take Chinese ability as the core selling point. Chinese effect mainly depends on the LLM, embedding, chunking and entity extraction effect, need to use Chinese sample PoC. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Can I connect Claude Code/Codex? | README clearly has Claude Code plugin and MCP Server. Codex can be integrated through MCP/CLI/API, and needs to be tested according to the actual client capability. |
10. PoC Recommendations
PoC 1: Pre-Sales Data Long-Term Memory Assistant
Input material:
-20 product proposals
-20 customer cases
-10 copies of competition information
-10 historical pre-sales questions and answers
Validation issues:
-What are the common pain points of customers in a certain industry?
-In which cases has a product capability been used?
-What is the difference between a competition and our scheme?
-What concerns were mentioned in a customer's historical communication?
Success Metrics:
-The answer can quote the correct information.
-Ability to link cases, product capabilities and customer issues.
-Relational questions answer more completely than ordinary RAG.
-Support to isolate different customer data by dataset.
PoC 2: Customer Service History Case Memory
Input material:
-100 historical work orders
-FAQ documentation
-Product change records
-Processing results and customer feedback
Validation issues:
-Are there similar historical cases of the current issue?
-Which treatments have failed?
-Which version change could cause this problem?
-What processing path should be recommended for the new job?
Success Metrics:
-Recall accuracy for similar cases.
-The answer can explain the reason for the association.
-Can reduce the time of repeated investigation by customer service.
PoC 3:AI Coding Project Memory
Input material:
-Project Readme
-Architecture Description
-Code specification
-Historical PR/issue summary
-Claude Code/Codex session record
Validation issues:
-What are the architectural constraints of the project?
-Why was a module designed like this before?
-What code styles should new features follow?
-How were similar bugs fixed in the past?
Success Metrics:
-Proactive recall of project context in new sessions.
-Reduce duplication in explaining project background.
-Be able to locate relevant historical decisions.
11. Risks and Considerations
| Risk | Description | Response Recommendations |
|---|---|---|
| LLM Extraction Quality | Map and Memory Quality Depends on LLM, embedding, chunking and pipeline | Evaluate with Real Samples, Not Just Demo |
| Graph error propagation | Error entities or relationships may affect subsequent recall | Add manual sampling, data rollback, and 'forget' policies |
| Security default items need to be strengthened | JWT secret, API key plaintext, local file path, Cypher query, etc. must be checked | Do security configuration checklist before going online |
| Multi-storage complexity | Deployment and troubleshooting costs brought by relational/vector/graph multi-layer | PoC uses the default stack, and production is preferred Postgres single stack |
| Cost | LLM/embedding calls, graph building, and improve may all incur costs | Estimate the number of chunks, calls, and concurrency first |
| Chinese/Industry Ontology | Chinese Entity Extraction and Domain Relationships may need customization | Prepare Industry Samples and Ontology/ontology |
| Does not equal to complete knowledge middle-end | Lack of complete product capabilities such as enterprise-level UI, approval, governance, authority audit, etc. | As a memory/RAG component, rather than an overall platform replacement |
12. My Pre-Sales Judgment
Cognee is one of the most important projects in this batch, because it has hit a very real pain point in the Agent industry: Agent can not only call tools, but also remember, continuously learn and understand enterprise knowledge according to relationships for a long time.
It is suitable for pre-sales positioning is not "another RAG framework",:
Agent long-term memory layer enterprise knowledge mapping memory vector/graph hybrid retrieval infrastructure.
The most suitable for three types of customers:
- Customers who have done RAG, but found that the response lacks context and relational reasoning.
- Customers who are working on an enterprise Agent platform and need memory layer.
- Customers with "experience precipitation" needs such as customer service, pre-sales, research and development, data analysis, etc.
It is not recommended to commit to production to replace the customer's existing knowledge platform. A more stable approach is to use 1-2 high-value scenarios for PoC, especially "historical cases, current problem relationship reasoning. If the cognee is significantly better than the ordinary vector RAG on these issues, it is easy to bring the customer to the "long-term memory/company brain" scenario narrative.