vector-database-embedding
Here are 22 public repositories matching this topic...
Language:All
Sort:Most stars
🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
- Updated
Jul 12, 2025 - TypeScript
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
- Updated
Apr 15, 2025 - TypeScript
A Python vector database you just need - no more, no less.
- Updated
Mar 4, 2024 - Python
A Python library to chunk/group your texts based on semantic similarity.
- Updated
Jul 11, 2024 - Python
S3 vector database for LLM Agents and RAG.
- Updated
Aug 15, 2023 - Python
Embed anything.
- Updated
May 24, 2024 - JavaScript
V3CTRON | Vector Embeddings Data Retrieval | ChatGPT Plugin
- Updated
Oct 18, 2023 - Python
Examples of vector DB indexing and query with various vector databases.
- Updated
Feb 12, 2025 - Python
MIRA is a little brain-in-a-box. It is able to recognize its own shortcomings and adapt over time to preempt them. It learns facts from conversation during a daily consolidation task and surfaces them nautrally in conversation. This repository includes supporting files that allow you to zero-shot new tools for MIRA. Works 100% well offline too.
- Updated
Jul 12, 2025 - Python
High-performance database management system
- Updated
Jul 10, 2025 - Python
Machine Learning, LLM and other Jupyter Notebooks and resources
- Updated
Jun 24, 2025 - Jupyter Notebook
Scalable API extension for advanced vector database functions. Enhance machine learning, search, and analytics applications with an API that supports efficient embedding storage and similarity searches.
- Updated
Aug 20, 2024 - TypeScript
Complete pipeline for generating DBpedia text embeddings using OpenAI's embedding models and publishing them as Hugging Face datasets.
- Updated
Jul 11, 2025 - Python
Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, benchmark performance across different embedding models, and better understand what is possible with vector search in production environments.
- Updated
Jun 30, 2025 - Python
Create a ChatGPT-like experience with your data.
- Updated
Jul 4, 2023 - Jupyter Notebook
ToucanDB is a brand-new micro ML-first database engine 🦜
- Updated
Jul 17, 2025 - Python
A web app that uses Retrieval-Augmented Generation (RAG) to create an AI expert over a codebase. The app allows users to interact with a codebase via chat, retrieving relevant code snippets from a Pinecone vector database and generating responses using LLMs.
- Updated
Dec 7, 2024 - Jupyter Notebook
This repository contains source code which encompasses usage of the Langchain framework to extract information from distinct types of documents and subsequently perform Retrieval Augmented Generation(RAG) on these documents as well.
- Updated
Nov 26, 2024 - Jupyter Notebook
End-to-End Research Bot for Summarizing and Extracting Insights from Multiple URLs using advanced text processing, FAISS vector storage, and OpenAI services for accurate and concise responses.
- Updated
Feb 7, 2025 - Jupyter Notebook
Improve this page
Add a description, image, and links to thevector-database-embedding topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thevector-database-embedding topic, visit your repo's landing page and select "manage topics."