llama-cpp-python
Here are 48 public repositories matching this topic...
Language:All
Sort:Most stars
a self-hosted webui for 30+ generative ai
- Updated
Nov 5, 2025 - Python
Setup and run a local LLM and Chatbot using consumer grade hardware.
- Updated
Oct 28, 2025 - JavaScript
Gradio based tool to run opensource LLM models directly from Huggingface
- Updated
Jun 27, 2024 - Python
An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.
- Updated
Aug 1, 2024 - Python
Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma
- Updated
Oct 7, 2024 - Python
GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a Pythonic way to run AI with knowledge (Ilm) on fire (Vulkan).
- Updated
Oct 14, 2025 - Python
Tool for test diferents large language models without code.
- Updated
Oct 18, 2025 - Python
Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.
- Updated
Oct 11, 2025 - Python
UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
- Updated
Sep 30, 2024 - Jupyter Notebook
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b
- Updated
Jan 13, 2024 - Python
A financial chatbot powered by an LLM and retrieval-augmented generation.
- Updated
Oct 2, 2023 - Jupyter Notebook
Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent
- Updated
Jul 27, 2024 - Python
TAO71 I4.0 is an AI created by TAO71 in Python.
- Updated
Aug 21, 2025 - Python
YouTube API implementation with Meta's Llama 2 to analyze comments and sentiments
- Updated
Dec 5, 2023 - Python
SOLAIRIA is a free tool with minimal dependencies that lets you interact with text-generation AI LLMs of your choice privately by running on your own local hardware offline.
- Updated
Jun 24, 2025 - Python
This is a adaptive RAG system decoupled ready for deployment and production
- Updated
Sep 28, 2025 - Python
A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat
- Updated
Jun 4, 2024 - Python
Runpod-LLM provides ready-to-use container scripts for running large language models (LLMs) easily on RunPod.
- Updated
May 20, 2025 - Shell
Clippy resurrected as an AI front end. 📃📎👀
- Updated
Sep 24, 2025 - Python
Improve this page
Add a description, image, and links to thellama-cpp-python topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thellama-cpp-python topic, visit your repo's landing page and select "manage topics."