1. Project/Product Overview
| Dimension | Information |
|---|---|
| Project name | RAGFlow |
| Developer | infiniflow (China Team) |
| Open Source License | Apache-2.0 |
| Main Languages | Go Python (TypeScript for Web Front End) |
| GitHub Stars | 84,059**(2026-07-02 Query, No.1 RAG Project in the World) |
| Forks | 9,767 |
| Commits | 7,081 |
| Open Issues | 2,608 |
| Created | 2023-12-12 (about 2.5 years old) |
| Last Updated | 2026-07-01 (Daily Active) |
| Latest Version | v0.26.2(2026-06-29), Total 58 Tags |
| official website | https://ragflow.io |
| Cloud Services | https://cloud.ragflow.io(Free / Starter $29/mo / Pro $129/mo / Enterprise) |
| Document | https://ragflow.io/docs/dev/ |
| Community | Discord, GitHub Issues / Discussions |
| Topics | rag, retrieval-augmented-generation, agentic-ai, ai-agents, context-engine, llm-apps, agentic-search, context-management, agentic-retrieval |
2. What does it mostly do?
The core positioning of the RAGFlow is " RAG Agent integrated context engine " -- not only does document retrieval, but also provides a set of full-link capabilities from document understanding to intelligent Agent.
Core Architecture (Six Modules)
用户/API 层 → Chat / Agent / 搜索界面 / REST API
↓
Agent 编排层 → Workflow(人工编排) + Agentic Workflow(LLM 自主规划)
↓
检索层 → 多路召回(向量 + 关键词 + 知识图谱) + 融合重排序
↓
索引层 → Elasticsearch / Infinity(全文 + 向量混合索引)
↓
知识提取层 → DeepDoc 引擎 / MinerU / Docling / OpenDataLoader
↓
数据源层 → 文件上传 / 数据源连接器(Confluence、S3、SharePoint、飞书等)
Main function module
| Module | Capability description |
|---|---|
| DeepDoc Engine | Deep Document Understanding: OCR Table Structure Recognition (TSR) Document Layout Recognition (DLR), Three-in-One Visual Model |
| Templated blocking | 9 blocking strategies: General, Manual, Q & A, Table, Paper, Book, Laws, Presentation, One |
| Multiple PDF Parsers | DeepDoc (default)/ Naive / MinerU / Docling/OpenDataLoader/3rd Party VLM |
| Multi-way hybrid search | Vector search BM25 Keyword search Tensor search Knowledge graph search, fusion reordering |
| Agent Workflow Unified Orchestration | Supports both manual Workflow and LLM Agentic Workflow on the visual canvas, including iteration, conditional branch, switch, and code executor |
| Full MCP Support | Import an MCP server, use an agent as an MCP client, and expose the RAGFlow itself as an MCP server. |
| GraphRAG | Dynamic knowledge graph construction at the dataset level, supporting entity extraction and community analysis |
| Long-Context RAG | Automatically generates a document directory (TOC) structure to mitigate context loss |
| RAPTOR | Cross-document hierarchical summary tree construction |
| Orchestration ingest pipeline | Visualizes the data ingest pipeline and supports custom data cleaning processes. |
| Multi-channel access | Flying books, Discord, Telegram, Line, Slack, Microsoft Teams, etc. |
| Data Source Connectors | Confluence, S3/OSS, Notion, Google Drive, JIRA, SharePoint, Salesforce, Outlook, OneDrive, Azure Blob |
| Memory | Agent dialog memory management, supports multiple rounds of context retention |
| Code execution sandbox | Python/JavaScript code executor (based on gVisor isolation) |
| Admin CLI | Command-line management tool to monitor service status |
| Python SDK | 'ragflow-sdk',Python programming interface, support programmatic management knowledge base |
3. Applicable Scenario
| Scenario | Description | Typical Customer |
|---|---|---|
| Complex Document Knowledge Base | PDF/Scanned Documents/Forms/Formula Intensive Document Q & A, DeepDoc Engine Core Scenarios | Law Firms, Financial Institutions, Accounting Firms |
| Enterprise RAG system | Multi-format document analysis, high-precision retrieval, traceability and reference, suitable for production-level deployment | Digital departments of medium and large enterprises |
| Intelligent Customer Service/FAQ ** | Multi-channel Access (Flying Book/Discord/Slack) Knowledge Base Q & A | Internet Company, E-commerce |
| Multi-Agent Collaboration System | Agent Workflow Unified Orchestration MCP Tool Integration | Enterprises that require complex business automation |
| GraphRAG multi-hop reasoning | Legal case association analysis, drug research and development literature mining and other scenarios that require entity relationship reasoning | Legal technology, biomedicine |
| Investment Research/Research Report Generation | Built-in "In-depth Analysis of Corporate Research Reports" Agent Template | Brokers, Investment Institutions |
| Legal Search/Case Analysis | Built-in "Legal Precedent Analysis" Agent Template, Structured Similar Case Analysis | Law Firms, Courts, Legal Departments |
| Manufacturing Maintenance Support | Built-in "Manufacturing Maintenance Support" template, accurate retrieval of external reference supplements from internal manuals | Manufacturing, equipment management |
4. Not quite the scene
| Scenario | Reason | Alternative Suggestions |
|---|---|---|
| plain text rapid prototyping (only a few lines of code) | RAGFlow is a platform-level solution, Docker deployment is required, and a scalpel is used to kill chickens in lightweight scenes | LlamaIndex/direct use of LangChain |
| Relying on the existing Elasticsearch infrastructure | RAGFlow uses the self-developed Infinity engine by default, and ES is only an optional backend | Haystack(ES deep integration) |
| Requires very fine-grained Pipeline code control | RAGFlow focuses on WebUI visual orchestration, and code-level flexibility is not as good as Haystack | Haystack |
| Low budget, small team, simple FAQ | High RAGFlow resource consumption (16GB RAM recommended), high cost for lightweight scenarios | MaxKB / FastGPT |
| Overseas Model Ecology Priority | RAGFlow support for Chinese models (Tongyi Thousand Questions, DeepSeek, etc.) is better than some overseas models | Dify(56 model providers) |
| Requires strong Workflow orchestration capability | RAGFlow Workflow is not as mature as Dify, and branch/loop/variable management is not as good as Dify | Dify |
| ARM64 platform deployment | ARM64 Docker images are not officially available. You need to build them yourself. | Dify (supports ARM64) |
5. Core Competence List
5.1 Document parsing capability (DeepDoc engine)
| Layout Analysis (DLR) | Transformer-based visual model that recognizes headings, paragraphs, tables, pictures, formulas, headers and footers, and multi-column layouts | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Table Structure Recognition (TSR) | Identifies header, data area, total row, row-column merge, and multi-level header. F1 > 95% | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| OCR Text Recognition | Self-developed in-depth learning OCR to recognize text in scanned documents, handwritten documents and low-quality pictures | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Formula Recognition | Transforms mathematical formulas to LaTeX format, preserving full mathematical semantics | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Mixed arrangement of pictures and texts | Identify the positional relationship between pictures and characters, and associate illustrations and pictures | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Multimodal Understanding | v0.19 + supports multimodal models to understand image content in PDF/docx | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Table of Document (TOC) Extraction | Automatically recognizes document outline structure for Long-Context RAG |
5.2 block strategy
| Policies | Applicable scenarios |
|---|---|
| General | Generic document with automatic blocking based on layout structure |
| Manual | Specify separator and block size manually |
| Q & A | Document in Q & A Pair Format |
| Table | Table-intensive documents (financial statements, etc.) |
| Paper | Academic papers, identifying abstract, chapter, reference structure |
| Book | Book, Identify Chapter, Section, Segment Levels |
| Laws | Laws and Regulations, Identifying Articles, Articles, Item Structures |
| Presentation | PPT document, divided by slide |
| One | Entire document as a single block, not cut |
| Knowledge Graph | Entity/relationship extraction for GraphRAG |
| TOC Extraction | Directory structure extraction for Long-Context RAG |
5.3 Agent / Workflow Capability
| Workflow (manual arrangement) | Visual canvas, Begin → Categorize → Retrieval → Agent → Message and other components are connected in series | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Agentic Workflow(LLM Autonomy) | LLM Autonomy Planning and Reflection (Planning Reflection), tool invocation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Multi-Agent Configuration | Orchestration of multiple agents on the same canvas. Sub-agents can be nested. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Code Executor | Python / JavaScript code execution (based on gVisor sandbox) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Structured output | Agent output in JSON / Markdown / Word format | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Conditional Branch | Switch component, which takes different branches according to classification results | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| iteration | Iteration component, array traversal processing | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Variable aggregation | Variable Aggregator, cross-step data passing | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Await Response | Pause process, actively collect user input | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Tool Integration | 21 built-in tools MCP Server Import Academic Search | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Web Search | Agents search the Internet autonomously | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Runtime Logs | Agent Execution Path Visualization and I/O Checking |
5.4 Models and Infrastructure
| Category | Scope of Support |
|---|---|
| LLM Providers | 44: OpenAI, Anthropic, DeepSeek, Ali Tongyi Thousand Questions, Baidu Wenxin Yiyan, Google Gemini, Volcano Engine, Ollama, vLLM, Xinference, LM Studio, OpenRouter, MiniMax, Moonshot, ZhipuAI, Baichuan, LocalAI, LiteLLM, etc. |
| Embedded models | OpenAI, BGE, Jina, Cohere, Voyage 4, HuggingFace, etc., support built-in and external |
| Reorder model | Cross-encoder, BGE-Reranker, etc. |
| Vector/Search Engine | Infinity (default, self-developed), Elasticsearch (optional), Qdrant (optional), Redis (external cache) |
| VLM (Visual Language Model) | DeepDoc built-in/third-party VLM(Qwen-VL, GPT-4V, etc.) |
| TTS (Speech Synthesis) | Fish Audio |
| Inference optimization | ' |
5.5 enterprise-class features
| Multi-Workspace | Multi-Team Management, Admin/Member Role, Project-Level Data Isolation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| API token | Beta token system, access to enterprise applications | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| API compatible with OpenAI | Chat Completions API compatible with OpenAI ecosystem | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Admin management plane | Web UI management panel, graphical user management and service monitoring | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Docker deployment | Docker-compose one-click start, supports GPU acceleration | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Helm Chart | K8s deployment support | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Reverse Proxy | Support HTTPS Nginx reverse proxy | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Prometheus monitoring | Indicator exposure, access to existing monitoring system | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Multilingual UI | 10 languages including Chinese, English, Korean, Italian, Japanese, French and Arabic |
6. Architecture/deployment/integration approach
Deployment Mode
| Mode | Description | Minimum requirements |
|---|---|---|
| Docker Compose (self-hosting) | 'git clone & & docker compose up -d', most commonly used | CPU≥ 4 cores, RAM ≥ 16GB,Disk ≥ 50 GB |
| GPU Accelerated Deployment | 'DEVICE = gpu', 5-20 times faster DeepDoc parsing | NVIDIA GPU (8GB VRAM recommended) |
| SaaS Cloud Edition | 'cloud.ragflow.io', ready-to-use | No server required |
| K8s / Helm | Enterprise cluster deployment | K8s cluster |
| Start source code development | Build and run from source code, suitable for secondary development | Python ≥ 3.13 |
Deployment steps (Docker)
# 1. 确保 vm.max_map_count >= 262144
sudo sysctl -w vm.max_map_count=262144
# 2. 克隆仓库
git clone https://github.com/infiniflow/ragflow.git
cd ragflow/docker
# 3. CPU 模式启动
docker compose -f docker-compose.yml up -d
# 4. GPU 模式启动(需 NVIDIA GPU)
# sed -i '1i DEVICE=gpu' .env
# docker compose -f docker-compose.yml up -d
# 5. 检查状态
docker logs -f docker-ragflow-cpu-1
# 6. 浏览器访问 http://<服务器IP>
LLM Integration Example (configured in WebUI)
Supports access to any model through OpenAI-compatible APIs:
-Ollama:'http://host.docker.internal:11434/v1' (local model)
-vLLM:'http://vllm-server:8000/v1'
-Alibaba Cloud Tongyi Thousand Questions: DashScope API Key
-DeepSeek: by DeepSeek the API Key
-Baichuan/ZhipuAI/MiniMax / Moonshot: respective API Key access
How to use #7.
Docker deployment WebUI usage (recommended way to get started)
# 1. 部署(见上节)
git clone https://github.com/infiniflow/ragflow.git && cd ragflow/docker
docker compose -f docker-compose.yml up -d
# 2. 浏览器登录 http://localhost 或 http://<服务器IP>
# 首次登录需注册管理员账号
WebUI Usage Process:
- Configure Model Provider: Go to the "Model Provider" page and add LLM (e. g., the common qwen-plus), embedded model (e. g., BGE), and reorder model.
- Create Knowledge Base: Click "Knowledge Base" → "New", and configure the chunking strategy (such as General) and PDF parser (DeepDoc) after naming it.
- Upload Document : Drag or select PDF/Word/PPT/Excel/image file, RAGFlow automatically parse and index
- Configure Search: Select the mixed search mode (vector keyword) and set the Top-K and similarity thresholds.
- Start Dialogue : Switch to the "Dialogue" tab and enter questions to get answers with traceability.
- Build Agent: Enter the "Agent" page, use the preset template or build from the blank canvas, and drag and drop the component to arrange the process.
- Release Channel : Release Agent as Bot for channels such as flying book/Discord/Slack, or embed its own system through API
Python SDK Usage
from ragflow_sdk import RAGFlow
# 连接 RAGFlow
rag = RAGFlow(
api_key="your_api_key",
base_url="http://localhost/v1"
)
# 创建/获取知识库
dataset = rag.create_dataset(name="企业规章制度")
# 上传文档
dataset.upload_documents(["员工手册.pdf", "薪酬制度.docx"])
# 检索
chunks = dataset.retrieve(
question="年假怎么请?",
top_k=5
)
# chat
for ans in dataset.chat_stream("年假怎么请?"):
print(ans, end="")
REST API calls
curl -X POST http://localhost/v1/api/chats \
-H "Authorization: Bearer " \
-H "Content-Type: application/json" \
-d '{
"dataset_ids": [""],
"question": "请说明年假申请流程",
"stream": true
}' 8. What can I say before sales
8.1 a sentence positioning
- * "RAGFlow is the most popular RAG open source engine on GitHub in the world -- 84,000 Stars,DeepDoc the strongest in-depth document understanding capability in the industry, allowing AI to real' read' your PDF, contracts and reports. "**
8.2 customer pain points → solutions
| Customer pain points | RAGFlow solutions |
|---|---|
| "AI answers irrelevant questions after importing PDF, table data is all wrong" | DeepDoc engine : visual model accurately analyzes tables/charts/scanned documents, complex table recognition F1 > 95%, far exceeding traditional OCR |
| "There are scanned documents, pictures and formulas in the document, which RAG system cannot understand at all" | DeepDoc three-in-one :OCR + TSR (Table Structure Recognition) + DLR (Layout Analysis) Parallel Processing, One Analysis and All Restores |
| "I chose the open source RAG framework but it won't work, so I need to write a lot of code" | **WebUI visualization : what you see is what you get, upload documents → configure blocks → dialogue test, non-technical personnel can also use it |
| "Agent is required to automate business processes, not just Q & A" | Agent + Workflow Unified Orchestration: Low-code canvas drag-and-drop components, multi-agent collaboration, MCP tools, and code execution are supported |
| "Data cannot be uploaded to the cloud and must be privately deployed locally" | Docker one-click deployment: completely localized, data cannot be released from the enterprise computer room, Apache-2.0 open source has no commercial restrictions |
| "The big boss asked if there was any authoritative endorsement, why should he believe you?" | **84K Stars RAG No.1 in the world , officially recommended by Aliyun, actively developing 7,000 + Commits |
| "How to integrate your own system? Employees are used to flying books/DingTalk" | Multi-channel access : flying books/Slack/Teams + OpenAI compatible API + Python SDK + REST API |
| "I also want to use knowledge graph, but I don't know how to match RAG" | Built-in GraphRAG: Data set-level dynamic knowledge graph, automatic entity relationship construction, multi-hop reasoning |
8.3 Differentiated Selling Points
vs MaxKB(1Panel Product):
| Comparison dimension | RAGFlow | MaxKB |
|---|---|---|
| Stars | 84,059 | ~14,000 |
| Document parsing | DeepDoc visual model, accurate parsing of forms/scans | Basic OCR, limited format support |
| Retrieval Capability | Multi-way Hybrid Retrieval Fusion Reorder GraphRAG | Basic RAG Retrieval |
| Agent | Agent Workflow Unified MCP Code Orchestration Sandbox | Basic Agent (relatively simple function) |
| Enterprise | Multi-Workspace, Role Management, Admin CLI, K8s | Single Tenant, Simple Permissions |
| Positioning | Professional RAG engine, deep document understanding | Lightweight FAQ system |
| Chinese | ⭐⭐⭐⭐ | ⭐⭐ |
| Deployment complexity | Medium (Docker One-click) | Simple (Lightweight) |
vs Haystack(deepset products):
| Comparison dimension | RAGFlow | Haystack | |
|---|---|---|---|
| Stars | 84,059 | ~25,800 | |
| Locate | Platform-level RAG Engine Agent | Code-level RAG Framework | |
| Document Understanding | DeepDoc self-developed visual model, out-of-the-box use | You need to combine your own components and rely on a third party | |
| Usage | WebUI visualization is the main API | Code-only Pipeline Hayhooks deployment | |
| Low Code | ⭐⭐⭐⭐⭐Fully visualized | ⭐pure code | |
| Code Level Control | ⭐⭐⭐SDK | ⭐⭐⭐⭐⭐Pipeline can be customized | |
| Agent Capabilities | ⭐⭐⭐⭐Visual Choreography MCP | ⭐⭐⭐Tool Calling / ReAct | |
| Overseas Enterprise Trust | ⭐⭐⭐mainly chinese community | ⭐⭐⭐⭐⭐Apple/Meta/NVIDIA in Use | |
| Chinese support | ⭐⭐⭐⭐⭐Chinese Document, Chinese Community | ⭐⭐⭐English-based |
vs LlamaIndex:
| Comparison dimension | RAGFlow | LlamaIndex |
|---|---|---|
| Positioning | Complete RAG product platform | Python RAG framework library |
| Difficulty to get started | WebUI zero code | Python code required |
| Document parsing | DeepDoc engine (built-in strong) | Dependency LlamaParse (paid) |
| Flexibility | ⭐⭐⭐WebUI Templates | ⭐⭐⭐⭐⭐Full Code Control |
| productization | out-of-the-box, SaaS | need to build their own front-end and operation and maintenance |
| Stars | 84,059 | ~40,000 |
vs Dify:
| Comparison dimension | RAGFlow | Dify |
|---|---|---|
| Stars | 84,059 | ~110,000 |
| Core Advantages | Document Understanding RAG Accuracy | Workflow Orchestration Plug-in Ecology |
| Document Parsing | ⭐⭐⭐⭐⭐DeepDoc the strongest | ⭐⭐Basic format, complex document weak |
| Workflow Arrangement | ⭐⭐⭐Base branch/iteration | ⭐⭐⭐⭐⭐40 tools, complex logic |
| Model Ecology | 44 Providers | 56 Provider Plugins |
| Application Scenarios | Complex Document Knowledge Base | Generic AI Application Building |
| Deployment complexity | Docker Compose | Docker Compose |
Core differences in a word:
- You need to build a document knowledge base, process PDF/contract/scanned documents → RAGFlow (document understanding is invincible)
- You want to build complex AI applications, Workflow → Dify (more mature choreography)
-You want code level fully controllable → Haystack / LlamaIndex
-You want simple FAQ lightweight → MaxKB
8.4 Customer Value Story Line
- Cut in (cause pain) :"Have you tried to use RAG system to process contract/research report, and AI of the results mixed up all the form data?"
- Dismantling (Root Cause of Location) :"The root cause is not that LLM can't do it, but that document parsing can't do it-traditional RAG cuts PDF as plain text, tables are cut into pieces, and naturally cannot be retrieved."
- Demonstration (muscle show) : Upload a complex PDF (including scanned pages of tables and charts) on the spot → DeepDoc analysis → accurate question and answer → trace the source to the original location. This is the most powerful demo.
- Extension (from RAG to Agent) :"Not only Q & A. You can drag and drop components to build Agent-automatically read contracts → extract key terms → compare historical cases → generate analysis reports."
- Landing (dispelling worries) :"Docker is deployed to your intranet with one click, and the data does not go out. API to interface with existing systems. Flying book/DingTalk robot, direct use by employees."
- End (Trust evidence) :"84,000 developers worldwide Star RAGFlow. Alibaba Cloud official recommendation. Open source free, Apache-2.0 protocol, no risk."
9. Frequently Asked Customer Questions
| Question | Answer |
|---|---|
| What is the difference between RAGFlow and Dify? Which one should I choose? | RAGFlow is strong in document understanding and RAG retrieval accuracy, and the DeepDoc engine processing PDF/scanned documents/forms far exceeds Dify;Dify is strong in Workflow arrangement and plug-in ecology. If the core requirement is Document Knowledge Base, select RAGFlow. If the core requirement is Complex AI Application Workflow, select Dify. Both can be used in combination. |
| What is the difference between the community edition and the enterprise edition? | The open source community edition has always been the main version. The Apache-2.0 protocol is fully functional (including DeepDoc, Agent, GraphRAG, and MCP). Enterprise Edition (SaaS Cloud Edition/Enterprise Edition) provides managed deployment, exclusive support, customized SLA, BYOC deployment. For private deployment customers, the community version is fully sufficient. |
| How to ensure data security? Will data be transmitted to the outside world? | In a fully privatized deployment, all data (documents, vectors, and conversation records) are stored on the local server. When LLM is called, the RAGFlow is just the client-it sends the retrieved text fragment, not the original document. If you do not want to get out of the network even LLM, you can use the Ollama/vLLM local model to run the entire offline **. |
| What hardware is required for deployment? Is a GPU required? | Minimum CPU: 4 cores, 16GB RAM, 50GB disk. GPU is not necessary, but highly recommended-DeepDoc OCR/table recognition/layout analysis is 5-20 times faster on GPU. It is recommended to have an NVIDIA GPU(8GB VRAM) for handling scenes with many scans. |
| Does it support the localization environment? (Shin Chuang/Kirin/Ascend)? | Support Kirin OS and Ascend Chips. RAGFlow supports access to domestic models through local reasoning frameworks such as vLLM/Xinference, such as DeepSeek, Tongyi Qiwen, and Baichuan. In the letter creation environment can be full link localization. |
| What is the size of the knowledge base? What is the performance? | The Infinity engine (self-developed high-performance vector database or full-text database) is used to support millions of documents. Retrieval performance: multiple recall fusion rearrangement, millisecond response. For hyperscale scenarios, we recommend Elasticsearch backend GPU acceleration. |
| Can we connect with our existing OA/CRM system? | There are three ways: 1)REST API(OpenAI compatible format);2)Python SDK (ragflow-sdk);3) HTTP/API components in Agent are directly connected to business systems. Data source connectors support common system synchronization such as Confluence, SharePoint, and Salesforce. |
| Is deployment and O & M complex? | Docker is started Compose a command: 'docker compose up -d '. Daily operation and maintenance mainly includes image upgrade ('docker compose pull & & docker compose up -d') and disk/memory monitoring. Provides Prometheus metrics access and Admin CLI management tools. |
| What file formats can be processed? Are scanned files supported? | Word, PPT, Excel, PDF, TXT, images (PNG/JPG/JPEG), web pages, Markdown, video files, etc. Scans are perfectly supported by DeepDoc self-developed OCR-one of RAGFlow's strongest capabilities. |
| What is the open source protocol? Can it be commercially available? Is it charged? | Apache-2.0,completely free for commercial use, without any restrictions. SaaS cloud version is charged by usage (Free / Starter $29/month/Pro $129/month), which is the cost of hosting services, and there is no charge for non-open source code itself. |
10. PoC Recommendations
Recommended PoC Direction: RAG System for Complex Document Knowledge Base
| Phase | Content | Time | Output |
|---|---|---|---|
| 1. Environment Construction | Docker Deployment RAGFlow (including GPU configuration), configuration of LLM (such as Tongyi Thousand Questions) and embedded model | 0.5 Days | Runnable Environment |
| 2. Document Import | Select 50-200 real and complex documents (PDF contracts/scanned documents/reports with forms) of customers, and configure General block DeepDoc analysis | 0.5 days | Indexed Knowledge Base |
| 3. Retrieval Verification | Using Typical Business Problem Test: Mixed Retrieval (Vector Keyword Tensor) Reorder | 0.5 Days | Verify Retrieval Accuracy |
| 4. Agent construction | Agent construction based on customer scenarios (e. g. "Extract risk labels for key contract terms") | 1 day | Agent demonstration |
| 5. Docking integration | Docking customer front end or flying book/DingTalk channel through API | 1 day | Complete system that can be tested internally |
| 6. Evaluation Report | Quantitative Evaluation: Recall Rate, Accuracy Rate, Response Time; Qualitative Evaluation: User Satisfaction | 0.5 Days | PoC Evaluation Report |
Total: about 4 working days
Recommended validation metrics:
-Retrieval recall rate (including table/image content)> 85%
-complex table data question and answer accuracy> 90%
-with traceability reference answer ratio> 95%
-End-to-end average response time <5 seconds
-Scan OCR accuracy> 95%
Key to the success of PoC:
-Must use customer real complex documents-DeepDoc ability to process these documents is the core selling point
-Focus on demonstration form question and answer and scan analysis, which is the place where the gap can be widened best.
-The traceability function must be displayed-click on the answer to jump to the original document position.
-GPU acceleration should be matched well, otherwise slow DeepDoc parsing speed will affect the demonstration experience.
11. Risks and Considerations
| Risk | Level | Description | Mitigation |
|---|---|---|---|
| High resource consumption | High | GPU is recommended for minimum 16GB RAM, which is 3-5 times higher than the hardware cost of MaxKB/Lightweight solution | Clarify hardware requirements in advance; SaaS cloud version can avoid hardware investment |
| ARM64 is not supported | Medium | ARM64 Docker images are not officially provided (such as Apple Silicon Mac servers) | x86 deployment or self-built images (official build guide) |
| The community version has fast iteration and instability | Medium | A major version every January-February, with occasional Breaking Changes (such as removing full images from v0.22) | Use the stable version (tag) and test before upgrading |
| Dependent on external LLM APIs | Medium | The RAGFlow itself does not have an LLM. You need to configure an additional LLM service (API or on-premises deployment) | Configure the Ollama/vLLM local model to implement all offline services |
| Agent capability is not as good as Dify | Low | Workflow orchestration, tool ecology, and plug-in market are not as rich as Dify | Pure RAG/document scenarios are sufficient; complex Workflow consider Dify linkage |
| Team is a start-up company | Low | infiniflow is a start-up team, and the commercialization path is still being explored | Apache-2.0 protocol, community version will not disappear; Major factories such as Aliyun are already integrating and promoting |
| Community support is mainly Chinese | Low | English community is relatively small, multinational enterprises may worry that | Chinese customers are the advantage; Community is active, Discord GitHub Issues responds quickly |
| Open source competition is fierce | Low | Dify, MaxKB and other projects are also developing rapidly | RAGFlow has the highest barrier on the core track of document understanding |
12. My Pre-Sales Judgment
Recommended: Most strongly recommended (RAGFlow is preferred for customers who need a document knowledge base/complex document RAG)
Reason:
- Document understanding barrier is the highest : OCR + TSR + DLR three-in-one vision model of DeepDoc engine, complex form F1 > 95%, which is the RAGFlow's hardest moat. Other frameworks (Dify, MaxKB, Haystack) have significant gaps in this area and are difficult to catch up in the short term.
- Global Community First :84,059 Stars not only represents recognition, but also means rich community resources-rapid problem solving, best practice precipitation and rich third-party integration.
- Platform Complete : From Document Parsing → Block → Retrieval → Reordering → Generation→Agent→Channel Publishing, Full Link Coverage and Can be Operated through WebUI. Not the framework, the product.
- Enterprise Friendly :Apache-2.0 is completely free for commercial use + Docker one-click deployment + data not going out + multi-channel access + Admin management plane. It is very friendly to Chinese enterprise customers (especially the needs of Xinchuang/localization).
- Fast iteration speed :7,000 + Commits, a major version every 1-2 months. MCP, GraphRAG, Agent, Memory and other cutting-edge capabilities are quickly followed up.
- Alibaba Cloud Endorsement: Recommended by Alibaba Cloud SAE official integration, highly available instances can be deployed with one click of SAE.
Recommended Customer Persona:
-There are a large number of complex documents that need RAG processing (PDF contracts, scans, reports with tables/charts)
-Need to secure data through private deployment
-Chinese scene-based
-Requires low code/visual operations (not pure code framework)
-Limited budget but pursue RAG effect (open source free)
-There are compliance requirements for Xinchuang/localization
-Requires multi-agent collaboration MCP tool integration
Not recommended situations:
-Requires strong Workflow orchestration capability (branch/loop/variable management) → Dify
-Pure code framework, Python developers who need extreme flexibility → Haystack / LlamaIndex
-Sufficient budget and need international large factory enterprise version support → Haystack Enterprise
-Simple FAQ, lightweight requirements, low hardware budget → MaxKB / FastGPT
-ARM64 platform and unable to build images on its own
-Overseas model ecology is the core demand (Dify's 56 providers are more abundant)
13. REFERENCE
-GitHub repository: https://github.com/infiniflow/ragflow
-Official Document: https://ragflow.io/docs/dev/
-Official website: https://ragflow.io
-SaaS Cloud Edition: https://cloud.ragflow.io
-Release Notes (Release Notes):https://ragflow.io/docs/dev/release_notes
-DeepDoc technical details: https://github.com/infiniflow/ragflow/tree/main/deepdoc
-Roadmap:https://github.com/infiniflow/ragflow/issues/12241
-Discord Community: https://discord.gg/NjYzJD3GM3
-Python SDK:https://pypi.org/project/ragflow-sdk/
-Helm Chart:https://github.com/infiniflow/ragflow/tree/main/helm
-Agentic Workflow details: https://www.ragflow.io/blog/agentic-workflow-whats-inside-ragflow-v0.20.0
-Alibaba Cloud SAE deployment: Alibaba Cloud SAE supports one-click deployment of RAGFlow highly available instances
-CSDN DeepDoc Technical Analysis: https://blog.csdn.net/wayle123/article/details/159760654
- Analysis Date: 2026-07-02 | Data Aging: GitHub Metadata Pull in Real Time, Product Functions Based on v0.26.2 Official Documents and Release Notes *