RAG 相关的热门 GitHub AI项目仓库
发现与 RAG 相关的最受欢迎的开源项目和工具,了解最新的开发趋势和创新。
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
infiniflowLlamaIndex is the leading framework for building LLM-powered agents over your data.
run-llamaLangchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
chatchat-spaceMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search
milvus-ioA modular graph-based Retrieval-Augmented Generation (RAG) system
microsoftPython ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
pathwaycomAn open-source RAG-based tool for chatting with your documents.
CinnamonAI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
deepset-aiChat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
sinaptik-aiCrawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
vanna-aiWelcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
meta-llama💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.
1Panel-devAI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
eosphoros-ai"LightRAG: Simple and Fast Retrieval-Augmented Generation"
HKUDSPython SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
raga-ai-hubDocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.
arc53Unified framework for building enterprise RAG pipelines with small, specialized models
llmware-aiThe TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
mastra-aiGen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
onyx-dot-app💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
neuml🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
oramasearchIn-depth tutorials on LLMs, RAGs and real-world AI agent applications.
patchy631Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
QwenLMBISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
dataelement本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
datawhalechinaDebug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
comet-mlPrivate & local AI personal knowledge management app for high entropy people.
reorproject🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, dashboards, BI and embedded AI. 📈📊📋🧑💻
CannerHigh accuracy RAG for answering questions from scientific documents with citations
Future-HouseRetrieval Augmented Generation (RAG) chatbot powered by Weaviate
weaviateKAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
OpenSPGSoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
SciPhi-AIAll-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
sigodenTest your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
promptfooOpen Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
zilliztechTEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.
TEN-frameworkCrawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apifyThe open source platform for AI-native application development.
TaskingAISuperduper: End-to-end framework for building custom AI applications and agents.
superduper-ioA visual playground for agentic workflows: Iterate over your agents 10x faster
PySpur-DevA suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
timescalePrompt-To-Agent : Create custom engineering agents for your codebase
potpie-aiData processing and instruction calling with ML, LLM and Vision LLM
katanamlPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
adbarRAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
truefoundryOpen Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack, Linear, Notion, YouTube, GitHub and more.
MODSetterAutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Marker-Inc-Korea🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
decodingml🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifacts, AI knowledge base integration, chrome extension clip & save, contextual memory, intelligent search, WYSIWYG AI editor and more, empowering you to effortlessly transform ideas into production-ready content.
refly-aiYour agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
gptmeWhat are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
humanlayerEko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
FellouAIThe AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
infiniflowInteract with your SQL database, Natural Language to SQL using LLMs
DataheraldNeo4j graph construction from unstructured data using LLMs
neo4j-labsEasily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
AnswerDotAIThe LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
PacktPublishingRAG that intelligently adapts to your use case, data, and queries
circlemind-aiStreamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
PeterH0323ModelScope-Agent: An agent framework connecting models in ModelScope with the world
modelscopeCohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
cohere-aiAdalFlow: The library to build & auto-optimize LLM applications.
SylphAI-IncA developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
NVIDIAA simple, secure and modern file encryption tool (and Rust library) with small explicit keys, no config options, and UNIX-style composability.
str4dEverything you need to know to build your own RAG application
BragAIAI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
swirlai














