rag-pipeline
Here are 633 public repositories matching this topic...
Language:All
Sort:Most stars
A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.
- Updated
Feb 20, 2026 - Jupyter Notebook
Demystify RAG by building it from scratch. Local LLMs, no black boxes - real understanding of embeddings, vector search, retrieval, and context-augmented generation.
- Updated
Dec 4, 2025 - JavaScript
High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic for scalable extraction.
- Updated
Nov 29, 2025 - Python
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
- Updated
Feb 14, 2026 - Python
HiveMind Protocol - A Local-First, Privacy-Preserving Architecture for Agentic RAG
- Updated
Dec 1, 2025 - TypeScript
Move from idea to production in hours with policy-driven autonomous AI agents. Unified Control Plane: Centralised tools, MCPs, models, data, and policies with consistent observability and governance.
- Updated
Feb 20, 2026 - Python
PDFStract - The Extraction and Chunking Layer in Your RAG Pipeline - Available as CLI - WEBUI - API
- Updated
Feb 14, 2026 - Python
A scalable RAG platform combining LangGraph agents, hybrid retrieval (Vector+Graph), and Ray orchestration on Kubernetes.
- Updated
Dec 27, 2025 - Python
A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.
- Updated
Jan 18, 2026 - Python
We believe that every SOTA result is only valid on its own dataset. RAGView provides a unified evaluation platform to benchmark different RAG methods on your specific data.
- Updated
Dec 5, 2025
RAG boilerplate with semantic/propositional chunking, hybrid search (BM25 + dense), LLM reranking, query enhancement agents, CrewAI orchestration, Qdrant vector search, Redis/Mongo sessioning, Celery ingestion pipeline, Gradio UI, and an evaluation suite (Hit-Rate, MRR, hybrid configs).
- Updated
Nov 18, 2025 - Python
Agent Fusion is a local RAG semantic search engine that gives AI agents instant access to your code, documentation (Markdown, Word, PDF). Query your codebase from code agents without hallucinations. Runs 100% locally, includes a lightweight embedding model, and optional multi-agent task orchestration. Deploy with a single JAR
- Updated
Nov 24, 2025 - Kotlin
An in-memory Vector Database & AI Gateway written in Go. Supports HNSW, Hybrid Search (BM25), GraphRAG context, a built-in RAG Pipeline, and can be embedded directly into your apps.
- Updated
Feb 20, 2026 - Go
CrawlAI RAG is an AI-powered website intelligence platform that allows users to crawl entire websites, index their content, and ask natural-language questions using Retrieval-Augmented Generation (RAG). It transforms static websites into queryable knowledge bases.
- Updated
Feb 15, 2026 - Python
Generate & Ship UI with minimal effort - Open Source Generative UI with natural language
- Updated
Apr 30, 2025 - TypeScript
Quickest way to production grade RAG UI.
- Updated
Feb 16, 2026 - TypeScript
Modular full-stack ML project leveraging Groq API, Streamlit, Supabase, JSON, SciPy, SciKit-Learn, Plotly & EmailJS, alongside libraries - NumPy, Pandas, Utils, OS, Base64, Re, Pillow & DateTime.
- Updated
Feb 9, 2026 - Jupyter Notebook
AI-powered mock interview platform using Next.js, Gemini AI, JSON, Drizzle, NeonDB, API routes and Clerk for dynamic questions, feedback & session recording, plus Dockerized & deployed microservices.
- Updated
Feb 9, 2026 - JavaScript
A python module library that simplifies RAG through abstraction
- Updated
Nov 25, 2025 - Python
Improve this page
Add a description, image, and links to therag-pipeline topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with therag-pipeline topic, visit your repo's landing page and select "manage topics."