RAGFlow - AI Navigation

← Back to Project List

RAGFlow is currently the RAG open source project with the highest number of Stars on GitHub (Apache-2.0,84,059 Stars,9,767 Forks), developed by the infiniflow team, and is the absolute benchmark of the global RAG track. Its core weapon is the self-developed DeepDoc depth document understanding engine -the three-in-one capability of layout analysis based on visual model, table structure recognition (TSR) and OCR, which can accurately restore tables, charts, formulas and paragraphs in complex formats such as PDF/Word/PPT/scanned documents into structured knowledge units, with complex tables F1 exceeding 95%. V0.26.0 has implemented enterprise-class capabilities such as Agent + Workflow unified orchestration, MCP full-function support, GraphRAG knowledge graph, multi-channel access, and code sandbox. It supports Docker one-click deployment or SaaS cloud version, and has perfect Chinese documents. It is the preferred open source scheme for domestic enterprises to build RAG/knowledge base system.

1. Project/Product Overview

Dimension	Information
Project name	RAGFlow
Developer	infiniflow (China Team)
Open Source License	Apache-2.0
Main Languages	Go Python (TypeScript for Web Front End)
GitHub Stars	84,059**(2026-07-02 Query, No.1 RAG Project in the World)
Forks	9,767
Commits	7,081
Open Issues	2,608
Created	2023-12-12 (about 2.5 years old)
Last Updated	2026-07-01 (Daily Active)
Latest Version	v0.26.2(2026-06-29), Total 58 Tags
official website	https://ragflow.io
Cloud Services	https://cloud.ragflow.io(Free / Starter $29/mo / Pro $129/mo / Enterprise)
Document	https://ragflow.io/docs/dev/
Community	Discord, GitHub Issues / Discussions
Topics	rag, retrieval-augmented-generation, agentic-ai, ai-agents, context-engine, llm-apps, agentic-search, context-management, agentic-retrieval

2. What does it mostly do?

The core positioning of the RAGFlow is " RAG Agent integrated context engine " -- not only does document retrieval, but also provides a set of full-link capabilities from document understanding to intelligent Agent.

Core Architecture (Six Modules)

用户/API 层  →  Chat / Agent / 搜索界面 / REST API
     ↓
Agent 编排层 →  Workflow（人工编排） + Agentic Workflow（LLM 自主规划）
     ↓
检索层     →  多路召回（向量 + 关键词 + 知识图谱） + 融合重排序
     ↓
索引层     →  Elasticsearch / Infinity（全文 + 向量混合索引）
     ↓
知识提取层  →  DeepDoc 引擎 / MinerU / Docling / OpenDataLoader
     ↓
数据源层    →  文件上传 / 数据源连接器（Confluence、S3、SharePoint、飞书等）

Main function module

Module	Capability description
DeepDoc Engine	Deep Document Understanding: OCR Table Structure Recognition (TSR) Document Layout Recognition (DLR), Three-in-One Visual Model
Templated blocking	9 blocking strategies: General, Manual, Q & A, Table, Paper, Book, Laws, Presentation, One
Multiple PDF Parsers	DeepDoc (default)/ Naive / MinerU / Docling/OpenDataLoader/3rd Party VLM
Multi-way hybrid search	Vector search BM25 Keyword search Tensor search Knowledge graph search, fusion reordering
Agent Workflow Unified Orchestration	Supports both manual Workflow and LLM Agentic Workflow on the visual canvas, including iteration, conditional branch, switch, and code executor
Full MCP Support	Import an MCP server, use an agent as an MCP client, and expose the RAGFlow itself as an MCP server.
GraphRAG	Dynamic knowledge graph construction at the dataset level, supporting entity extraction and community analysis
Long-Context RAG	Automatically generates a document directory (TOC) structure to mitigate context loss
RAPTOR	Cross-document hierarchical summary tree construction
Orchestration ingest pipeline	Visualizes the data ingest pipeline and supports custom data cleaning processes.
Multi-channel access	Flying books, Discord, Telegram, Line, Slack, Microsoft Teams, etc.
Data Source Connectors	Confluence, S3/OSS, Notion, Google Drive, JIRA, SharePoint, Salesforce, Outlook, OneDrive, Azure Blob
Memory	Agent dialog memory management, supports multiple rounds of context retention
Code execution sandbox	Python/JavaScript code executor (based on gVisor isolation)
Admin CLI	Command-line management tool to monitor service status
Python SDK	'ragflow-sdk',Python programming interface, support programmatic management knowledge base

3. Applicable Scenario

Scenario	Description	Typical Customer
Complex Document Knowledge Base	PDF/Scanned Documents/Forms/Formula Intensive Document Q & A, DeepDoc Engine Core Scenarios	Law Firms, Financial Institutions, Accounting Firms
Enterprise RAG system	Multi-format document analysis, high-precision retrieval, traceability and reference, suitable for production-level deployment	Digital departments of medium and large enterprises
Intelligent Customer Service/FAQ **	Multi-channel Access (Flying Book/Discord/Slack) Knowledge Base Q & A	Internet Company, E-commerce
Multi-Agent Collaboration System	Agent Workflow Unified Orchestration MCP Tool Integration	Enterprises that require complex business automation
GraphRAG multi-hop reasoning	Legal case association analysis, drug research and development literature mining and other scenarios that require entity relationship reasoning	Legal technology, biomedicine
Investment Research/Research Report Generation	Built-in "In-depth Analysis of Corporate Research Reports" Agent Template	Brokers, Investment Institutions
Legal Search/Case Analysis	Built-in "Legal Precedent Analysis" Agent Template, Structured Similar Case Analysis	Law Firms, Courts, Legal Departments
Manufacturing Maintenance Support	Built-in "Manufacturing Maintenance Support" template, accurate retrieval of external reference supplements from internal manuals	Manufacturing, equipment management

4. Not quite the scene

Scenario	Reason	Alternative Suggestions
plain text rapid prototyping (only a few lines of code)	RAGFlow is a platform-level solution, Docker deployment is required, and a scalpel is used to kill chickens in lightweight scenes	LlamaIndex/direct use of LangChain
Relying on the existing Elasticsearch infrastructure	RAGFlow uses the self-developed Infinity engine by default, and ES is only an optional backend	Haystack(ES deep integration)
Requires very fine-grained Pipeline code control	RAGFlow focuses on WebUI visual orchestration, and code-level flexibility is not as good as Haystack	Haystack
Low budget, small team, simple FAQ	High RAGFlow resource consumption (16GB RAM recommended), high cost for lightweight scenarios	MaxKB / FastGPT
Overseas Model Ecology Priority	RAGFlow support for Chinese models (Tongyi Thousand Questions, DeepSeek, etc.) is better than some overseas models	Dify(56 model providers)
Requires strong Workflow orchestration capability	RAGFlow Workflow is not as mature as Dify, and branch/loop/variable management is not as good as Dify	Dify
ARM64 platform deployment	ARM64 Docker images are not officially available. You need to build them yourself.	Dify (supports ARM64)

5. Core Competence List

5.1 Document parsing capability (DeepDoc engine)


Layout Analysis (DLR)	Transformer-based visual model that recognizes headings, paragraphs, tables, pictures, formulas, headers and footers, and multi-column layouts
Table Structure Recognition (TSR)	Identifies header, data area, total row, row-column merge, and multi-level header. F1 > 95%
OCR Text Recognition	Self-developed in-depth learning OCR to recognize text in scanned documents, handwritten documents and low-quality pictures
Formula Recognition	Transforms mathematical formulas to LaTeX format, preserving full mathematical semantics
Mixed arrangement of pictures and texts	Identify the positional relationship between pictures and characters, and associate illustrations and pictures
Multimodal Understanding	v0.19 + supports multimodal models to understand image content in PDF/docx
Table of Document (TOC) Extraction	Automatically recognizes document outline structure for Long-Context RAG

5.2 block strategy

Policies	Applicable scenarios
General	Generic document with automatic blocking based on layout structure
Manual	Specify separator and block size manually
Q & A	Document in Q & A Pair Format
Table	Table-intensive documents (financial statements, etc.)
Paper	Academic papers, identifying abstract, chapter, reference structure
Book	Book, Identify Chapter, Section, Segment Levels
Laws	Laws and Regulations, Identifying Articles, Articles, Item Structures
Presentation	PPT document, divided by slide
One	Entire document as a single block, not cut
Knowledge Graph	Entity/relationship extraction for GraphRAG
TOC Extraction	Directory structure extraction for Long-Context RAG

5.3 Agent / Workflow Capability


Workflow (manual arrangement)	Visual canvas, Begin → Categorize → Retrieval → Agent → Message and other components are connected in series
Agentic Workflow(LLM Autonomy)	LLM Autonomy Planning and Reflection (Planning Reflection), tool invocation
Multi-Agent Configuration	Orchestration of multiple agents on the same canvas. Sub-agents can be nested.
Code Executor	Python / JavaScript code execution (based on gVisor sandbox)
Structured output	Agent output in JSON / Markdown / Word format
Conditional Branch	Switch component, which takes different branches according to classification results
iteration	Iteration component, array traversal processing
Variable aggregation	Variable Aggregator, cross-step data passing
Await Response	Pause process, actively collect user input
Tool Integration	21 built-in tools MCP Server Import Academic Search
Web Search	Agents search the Internet autonomously
Runtime Logs	Agent Execution Path Visualization and I/O Checking

5.4 Models and Infrastructure

Category	Scope of Support
LLM Providers	44: OpenAI, Anthropic, DeepSeek, Ali Tongyi Thousand Questions, Baidu Wenxin Yiyan, Google Gemini, Volcano Engine, Ollama, vLLM, Xinference, LM Studio, OpenRouter, MiniMax, Moonshot, ZhipuAI, Baichuan, LocalAI, LiteLLM, etc.
Embedded models	OpenAI, BGE, Jina, Cohere, Voyage 4, HuggingFace, etc., support built-in and external
Reorder model	Cross-encoder, BGE-Reranker, etc.
Vector/Search Engine	Infinity (default, self-developed), Elasticsearch (optional), Qdrant (optional), Redis (external cache)
VLM (Visual Language Model)	DeepDoc built-in/third-party VLM(Qwen-VL, GPT-4V, etc.)
TTS (Speech Synthesis)	Fish Audio
Inference optimization	'' text buffering that supports inference models

5.5 enterprise-class features


Multi-Workspace	Multi-Team Management, Admin/Member Role, Project-Level Data Isolation
API token	Beta token system, access to enterprise applications
API compatible with OpenAI	Chat Completions API compatible with OpenAI ecosystem
Admin management plane	Web UI management panel, graphical user management and service monitoring
Docker deployment	Docker-compose one-click start, supports GPU acceleration
Helm Chart	K8s deployment support
Reverse Proxy	Support HTTPS Nginx reverse proxy
Prometheus monitoring	Indicator exposure, access to existing monitoring system
Multilingual UI	10 languages including Chinese, English, Korean, Italian, Japanese, French and Arabic

6. Architecture/deployment/integration approach

Deployment Mode

Mode	Description	Minimum requirements
Docker Compose (self-hosting)	'git clone & & docker compose up -d', most commonly used	CPU≥ 4 cores, RAM ≥ 16GB,Disk ≥ 50 GB
GPU Accelerated Deployment	'DEVICE = gpu', 5-20 times faster DeepDoc parsing	NVIDIA GPU (8GB VRAM recommended)
SaaS Cloud Edition	'cloud.ragflow.io', ready-to-use	No server required
K8s / Helm	Enterprise cluster deployment	K8s cluster
Start source code development	Build and run from source code, suitable for secondary development	Python ≥ 3.13

Deployment steps (Docker)

# 1. 确保 vm.max_map_count >= 262144
sudo sysctl -w vm.max_map_count=262144

# 2. 克隆仓库
git clone https://github.com/infiniflow/ragflow.git
cd ragflow/docker

# 3. CPU 模式启动
docker compose -f docker-compose.yml up -d

# 4. GPU 模式启动（需 NVIDIA GPU）
# sed -i '1i DEVICE=gpu' .env
# docker compose -f docker-compose.yml up -d

# 5. 检查状态
docker logs -f docker-ragflow-cpu-1

# 6. 浏览器访问 http://<服务器IP>

LLM Integration Example (configured in WebUI)

Supports access to any model through OpenAI-compatible APIs:

-Ollama:'http://host.docker.internal:11434/v1' (local model)

-vLLM:'http://vllm-server:8000/v1'

-Alibaba Cloud Tongyi Thousand Questions: DashScope API Key

-DeepSeek: by DeepSeek the API Key

-Baichuan/ZhipuAI/MiniMax / Moonshot: respective API Key access

How to use #7.

Docker deployment WebUI usage (recommended way to get started)

# 1. 部署（见上节）
git clone https://github.com/infiniflow/ragflow.git && cd ragflow/docker
docker compose -f docker-compose.yml up -d

# 2. 浏览器登录 http://localhost 或 http://<服务器IP>
#    首次登录需注册管理员账号

WebUI Usage Process:

Configure Model Provider: Go to the "Model Provider" page and add LLM (e. g., the common qwen-plus), embedded model (e. g., BGE), and reorder model.
Create Knowledge Base: Click "Knowledge Base" → "New", and configure the chunking strategy (such as General) and PDF parser (DeepDoc) after naming it.
Upload Document : Drag or select PDF/Word/PPT/Excel/image file, RAGFlow automatically parse and index
Configure Search: Select the mixed search mode (vector keyword) and set the Top-K and similarity thresholds.
Start Dialogue : Switch to the "Dialogue" tab and enter questions to get answers with traceability.
Build Agent: Enter the "Agent" page, use the preset template or build from the blank canvas, and drag and drop the component to arrange the process.
Release Channel : Release Agent as Bot for channels such as flying book/Discord/Slack, or embed its own system through API

Python SDK Usage

from ragflow_sdk import RAGFlow

# 连接 RAGFlow
rag = RAGFlow(
    api_key="your_api_key",
    base_url="http://localhost/v1"
)

# 创建/获取知识库
dataset = rag.create_dataset(name="企业规章制度")

# 上传文档
dataset.upload_documents(["员工手册.pdf", "薪酬制度.docx"])

# 检索
chunks = dataset.retrieve(
    question="年假怎么请？",
    top_k=5
)

# chat
for ans in dataset.chat_stream("年假怎么请？"):
    print(ans, end="")

REST API calls

curl -X POST http://localhost/v1/api/chats \
  -H "Authorization: Bearer " \
  -H "Content-Type: application/json" \
  -d '{
    "dataset_ids": [""],
    "question": "请说明年假申请流程",
    "stream": true
  }'

8. What can I say before sales

8.1 a sentence positioning

* "RAGFlow is the most popular RAG open source engine on GitHub in the world -- 84,000 Stars,DeepDoc the strongest in-depth document understanding capability in the industry, allowing AI to real' read' your PDF, contracts and reports. "**

8.2 customer pain points → solutions

Customer pain points	RAGFlow solutions
"AI answers irrelevant questions after importing PDF, table data is all wrong"	DeepDoc engine : visual model accurately analyzes tables/charts/scanned documents, complex table recognition F1 > 95%, far exceeding traditional OCR
"There are scanned documents, pictures and formulas in the document, which RAG system cannot understand at all"	DeepDoc three-in-one :OCR + TSR (Table Structure Recognition) + DLR (Layout Analysis) Parallel Processing, One Analysis and All Restores
"I chose the open source RAG framework but it won't work, so I need to write a lot of code"	**WebUI visualization : what you see is what you get, upload documents → configure blocks → dialogue test, non-technical personnel can also use it
"Agent is required to automate business processes, not just Q & A"	Agent + Workflow Unified Orchestration: Low-code canvas drag-and-drop components, multi-agent collaboration, MCP tools, and code execution are supported
"Data cannot be uploaded to the cloud and must be privately deployed locally"	Docker one-click deployment: completely localized, data cannot be released from the enterprise computer room, Apache-2.0 open source has no commercial restrictions
"The big boss asked if there was any authoritative endorsement, why should he believe you?"	**84K Stars RAG No.1 in the world , officially recommended by Aliyun, actively developing 7,000 + Commits
"How to integrate your own system? Employees are used to flying books/DingTalk"	Multi-channel access : flying books/Slack/Teams + OpenAI compatible API + Python SDK + REST API
"I also want to use knowledge graph, but I don't know how to match RAG"	Built-in GraphRAG: Data set-level dynamic knowledge graph, automatic entity relationship construction, multi-hop reasoning

8.3 Differentiated Selling Points

vs MaxKB(1Panel Product):

Comparison dimension	RAGFlow	MaxKB
Stars	84,059	~14,000
Document parsing	DeepDoc visual model, accurate parsing of forms/scans	Basic OCR, limited format support
Retrieval Capability	Multi-way Hybrid Retrieval Fusion Reorder GraphRAG	Basic RAG Retrieval
Agent	Agent Workflow Unified MCP Code Orchestration Sandbox	Basic Agent (relatively simple function)
Enterprise	Multi-Workspace, Role Management, Admin CLI, K8s	Single Tenant, Simple Permissions
Positioning	Professional RAG engine, deep document understanding	Lightweight FAQ system
Chinese	⭐⭐⭐⭐	⭐⭐
Deployment complexity	Medium (Docker One-click)	Simple (Lightweight)

vs Haystack(deepset products):

Comparison dimension	RAGFlow	Haystack
Stars	84,059	~25,800
Locate	Platform-level RAG Engine Agent	Code-level RAG Framework
Document Understanding	DeepDoc self-developed visual model, out-of-the-box use	You need to combine your own components and rely on a third party
Usage	WebUI visualization is the main API	Code-only Pipeline Hayhooks deployment
Low Code	⭐⭐⭐⭐⭐Fully visualized	⭐pure code
Code Level Control	⭐⭐⭐SDK	⭐⭐⭐⭐⭐Pipeline can be customized
	Agent Capabilities	⭐⭐⭐⭐Visual Choreography MCP	⭐⭐⭐Tool Calling / ReAct
Overseas Enterprise Trust	⭐⭐⭐mainly chinese community	⭐⭐⭐⭐⭐Apple/Meta/NVIDIA in Use
Chinese support	⭐⭐⭐⭐⭐Chinese Document, Chinese Community	⭐⭐⭐English-based

vs LlamaIndex:

Comparison dimension	RAGFlow	LlamaIndex
Positioning	Complete RAG product platform	Python RAG framework library
Difficulty to get started	WebUI zero code	Python code required
Document parsing	DeepDoc engine (built-in strong)	Dependency LlamaParse (paid)
Flexibility	⭐⭐⭐WebUI Templates	⭐⭐⭐⭐⭐Full Code Control
productization	out-of-the-box, SaaS	need to build their own front-end and operation and maintenance
Stars	84,059	~40,000

vs Dify:

Comparison dimension	RAGFlow	Dify
Stars	84,059	~110,000
Core Advantages	Document Understanding RAG Accuracy	Workflow Orchestration Plug-in Ecology
Document Parsing	⭐⭐⭐⭐⭐DeepDoc the strongest	⭐⭐Basic format, complex document weak
Workflow Arrangement	⭐⭐⭐Base branch/iteration	⭐⭐⭐⭐⭐40 tools, complex logic
Model Ecology	44 Providers	56 Provider Plugins
Application Scenarios	Complex Document Knowledge Base	Generic AI Application Building
Deployment complexity	Docker Compose	Docker Compose

Core differences in a word:

- You need to build a document knowledge base, process PDF/contract/scanned documents → RAGFlow (document understanding is invincible)

- You want to build complex AI applications, Workflow → Dify (more mature choreography)

-You want code level fully controllable → Haystack / LlamaIndex

-You want simple FAQ lightweight → MaxKB

8.4 Customer Value Story Line

Cut in (cause pain) :"Have you tried to use RAG system to process contract/research report, and AI of the results mixed up all the form data?"
Dismantling (Root Cause of Location) :"The root cause is not that LLM can't do it, but that document parsing can't do it-traditional RAG cuts PDF as plain text, tables are cut into pieces, and naturally cannot be retrieved."
Demonstration (muscle show) : Upload a complex PDF (including scanned pages of tables and charts) on the spot → DeepDoc analysis → accurate question and answer → trace the source to the original location. This is the most powerful demo.
Extension (from RAG to Agent) :"Not only Q & A. You can drag and drop components to build Agent-automatically read contracts → extract key terms → compare historical cases → generate analysis reports."
Landing (dispelling worries) :"Docker is deployed to your intranet with one click, and the data does not go out. API to interface with existing systems. Flying book/DingTalk robot, direct use by employees."
End (Trust evidence) :"84,000 developers worldwide Star RAGFlow. Alibaba Cloud official recommendation. Open source free, Apache-2.0 protocol, no risk."

9. Frequently Asked Customer Questions

Question	Answer
What is the difference between RAGFlow and Dify? Which one should I choose?	RAGFlow is strong in document understanding and RAG retrieval accuracy, and the DeepDoc engine processing PDF/scanned documents/forms far exceeds Dify;Dify is strong in Workflow arrangement and plug-in ecology. If the core requirement is Document Knowledge Base, select RAGFlow. If the core requirement is Complex AI Application Workflow, select Dify. Both can be used in combination.
What is the difference between the community edition and the enterprise edition?	The open source community edition has always been the main version. The Apache-2.0 protocol is fully functional (including DeepDoc, Agent, GraphRAG, and MCP). Enterprise Edition (SaaS Cloud Edition/Enterprise Edition) provides managed deployment, exclusive support, customized SLA, BYOC deployment. For private deployment customers, the community version is fully sufficient.
How to ensure data security? Will data be transmitted to the outside world?	In a fully privatized deployment, all data (documents, vectors, and conversation records) are stored on the local server. When LLM is called, the RAGFlow is just the client-it sends the retrieved text fragment, not the original document. If you do not want to get out of the network even LLM, you can use the Ollama/vLLM local model to run the entire offline **.
What hardware is required for deployment? Is a GPU required?	Minimum CPU: 4 cores, 16GB RAM, 50GB disk. GPU is not necessary, but highly recommended-DeepDoc OCR/table recognition/layout analysis is 5-20 times faster on GPU. It is recommended to have an NVIDIA GPU(8GB VRAM) for handling scenes with many scans.
Does it support the localization environment? (Shin Chuang/Kirin/Ascend)?	Support Kirin OS and Ascend Chips. RAGFlow supports access to domestic models through local reasoning frameworks such as vLLM/Xinference, such as DeepSeek, Tongyi Qiwen, and Baichuan. In the letter creation environment can be full link localization.
What is the size of the knowledge base? What is the performance?	The Infinity engine (self-developed high-performance vector database or full-text database) is used to support millions of documents. Retrieval performance: multiple recall fusion rearrangement, millisecond response. For hyperscale scenarios, we recommend Elasticsearch backend GPU acceleration.
Can we connect with our existing OA/CRM system?	There are three ways: 1)REST API(OpenAI compatible format);2)Python SDK (ragflow-sdk);3) HTTP/API components in Agent are directly connected to business systems. Data source connectors support common system synchronization such as Confluence, SharePoint, and Salesforce.
Is deployment and O & M complex?	Docker is started Compose a command: 'docker compose up -d '. Daily operation and maintenance mainly includes image upgrade ('docker compose pull & & docker compose up -d') and disk/memory monitoring. Provides Prometheus metrics access and Admin CLI management tools.
What file formats can be processed? Are scanned files supported?	Word, PPT, Excel, PDF, TXT, images (PNG/JPG/JPEG), web pages, Markdown, video files, etc. Scans are perfectly supported by DeepDoc self-developed OCR-one of RAGFlow's strongest capabilities.
What is the open source protocol? Can it be commercially available? Is it charged?	Apache-2.0,completely free for commercial use, without any restrictions. SaaS cloud version is charged by usage (Free / Starter $29/month/Pro $129/month), which is the cost of hosting services, and there is no charge for non-open source code itself.

10. PoC Recommendations

Recommended PoC Direction: RAG System for Complex Document Knowledge Base

Phase	Content	Time	Output
1. Environment Construction	Docker Deployment RAGFlow (including GPU configuration), configuration of LLM (such as Tongyi Thousand Questions) and embedded model	0.5 Days	Runnable Environment
2. Document Import	Select 50-200 real and complex documents (PDF contracts/scanned documents/reports with forms) of customers, and configure General block DeepDoc analysis	0.5 days	Indexed Knowledge Base
3. Retrieval Verification	Using Typical Business Problem Test: Mixed Retrieval (Vector Keyword Tensor) Reorder	0.5 Days	Verify Retrieval Accuracy
4. Agent construction	Agent construction based on customer scenarios (e. g. "Extract risk labels for key contract terms")	1 day	Agent demonstration
5. Docking integration	Docking customer front end or flying book/DingTalk channel through API	1 day	Complete system that can be tested internally
6. Evaluation Report	Quantitative Evaluation: Recall Rate, Accuracy Rate, Response Time; Qualitative Evaluation: User Satisfaction	0.5 Days	PoC Evaluation Report

Total: about 4 working days

Recommended validation metrics:

-Retrieval recall rate (including table/image content)> 85%

-complex table data question and answer accuracy> 90%

-with traceability reference answer ratio> 95%

-End-to-end average response time <5 seconds

-Scan OCR accuracy> 95%

Key to the success of PoC:

-Must use customer real complex documents-DeepDoc ability to process these documents is the core selling point

-Focus on demonstration form question and answer and scan analysis, which is the place where the gap can be widened best.

-The traceability function must be displayed-click on the answer to jump to the original document position.

-GPU acceleration should be matched well, otherwise slow DeepDoc parsing speed will affect the demonstration experience.

11. Risks and Considerations

Risk	Level	Description	Mitigation
High resource consumption	High	GPU is recommended for minimum 16GB RAM, which is 3-5 times higher than the hardware cost of MaxKB/Lightweight solution	Clarify hardware requirements in advance; SaaS cloud version can avoid hardware investment
ARM64 is not supported	Medium	ARM64 Docker images are not officially provided (such as Apple Silicon Mac servers)	x86 deployment or self-built images (official build guide)
The community version has fast iteration and instability	Medium	A major version every January-February, with occasional Breaking Changes (such as removing full images from v0.22)	Use the stable version (tag) and test before upgrading
Dependent on external LLM APIs	Medium	The RAGFlow itself does not have an LLM. You need to configure an additional LLM service (API or on-premises deployment)	Configure the Ollama/vLLM local model to implement all offline services
Agent capability is not as good as Dify	Low	Workflow orchestration, tool ecology, and plug-in market are not as rich as Dify	Pure RAG/document scenarios are sufficient; complex Workflow consider Dify linkage
Team is a start-up company	Low	infiniflow is a start-up team, and the commercialization path is still being explored	Apache-2.0 protocol, community version will not disappear; Major factories such as Aliyun are already integrating and promoting
Community support is mainly Chinese	Low	English community is relatively small, multinational enterprises may worry that	Chinese customers are the advantage; Community is active, Discord GitHub Issues responds quickly
Open source competition is fierce	Low	Dify, MaxKB and other projects are also developing rapidly	RAGFlow has the highest barrier on the core track of document understanding

12. My Pre-Sales Judgment

Recommended: Most strongly recommended (RAGFlow is preferred for customers who need a document knowledge base/complex document RAG)

Reason:

Document understanding barrier is the highest : OCR + TSR + DLR three-in-one vision model of DeepDoc engine, complex form F1 > 95%, which is the RAGFlow's hardest moat. Other frameworks (Dify, MaxKB, Haystack) have significant gaps in this area and are difficult to catch up in the short term.
Global Community First :84,059 Stars not only represents recognition, but also means rich community resources-rapid problem solving, best practice precipitation and rich third-party integration.
Platform Complete : From Document Parsing → Block → Retrieval → Reordering → Generation→Agent→Channel Publishing, Full Link Coverage and Can be Operated through WebUI. Not the framework, the product.
Enterprise Friendly :Apache-2.0 is completely free for commercial use + Docker one-click deployment + data not going out + multi-channel access + Admin management plane. It is very friendly to Chinese enterprise customers (especially the needs of Xinchuang/localization).
Fast iteration speed :7,000 + Commits, a major version every 1-2 months. MCP, GraphRAG, Agent, Memory and other cutting-edge capabilities are quickly followed up.
Alibaba Cloud Endorsement: Recommended by Alibaba Cloud SAE official integration, highly available instances can be deployed with one click of SAE.

Recommended Customer Persona:

-There are a large number of complex documents that need RAG processing (PDF contracts, scans, reports with tables/charts)

-Need to secure data through private deployment

-Chinese scene-based

-Requires low code/visual operations (not pure code framework)

-Limited budget but pursue RAG effect (open source free)

-There are compliance requirements for Xinchuang/localization

-Requires multi-agent collaboration MCP tool integration

Not recommended situations:

-Requires strong Workflow orchestration capability (branch/loop/variable management) → Dify

-Pure code framework, Python developers who need extreme flexibility → Haystack / LlamaIndex

-Sufficient budget and need international large factory enterprise version support → Haystack Enterprise

-Simple FAQ, lightweight requirements, low hardware budget → MaxKB / FastGPT

-ARM64 platform and unable to build images on its own

-Overseas model ecology is the core demand (Dify's 56 providers are more abundant)

13. REFERENCE

-GitHub repository: https://github.com/infiniflow/ragflow

-Official Document: https://ragflow.io/docs/dev/

-Official website: https://ragflow.io

-SaaS Cloud Edition: https://cloud.ragflow.io

-Release Notes (Release Notes):https://ragflow.io/docs/dev/release_notes

-DeepDoc technical details: https://github.com/infiniflow/ragflow/tree/main/deepdoc

-Roadmap:https://github.com/infiniflow/ragflow/issues/12241

-Discord Community: https://discord.gg/NjYzJD3GM3

-Python SDK:https://pypi.org/project/ragflow-sdk/

-Helm Chart:https://github.com/infiniflow/ragflow/tree/main/helm

-Agentic Workflow details: https://www.ragflow.io/blog/agentic-workflow-whats-inside-ragflow-v0.20.0

-Alibaba Cloud SAE deployment: Alibaba Cloud SAE supports one-click deployment of RAGFlow highly available instances

-CSDN DeepDoc Technical Analysis: https://blog.csdn.net/wayle123/article/details/159760654

Analysis Date: 2026-07-02 | Data Aging: GitHub Metadata Pull in Real Time, Product Functions Based on v0.26.2 Official Documents and Release Notes *