Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

llama-cpp-python

Here are 48 public repositories matching this topic...

biniou

Setup and run a local LLM and Chatbot using consumer grade hardware.

  • UpdatedOct 28, 2025
  • JavaScript

Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.

  • UpdatedFeb 12, 2025
  • Python

An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.

  • UpdatedAug 1, 2024
  • Python

Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma

  • UpdatedOct 7, 2024
  • Python

GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a Pythonic way to run AI with knowledge (Ilm) on fire (Vulkan).

  • UpdatedOct 14, 2025
  • Python

UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

  • UpdatedSep 30, 2024
  • Jupyter Notebook

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

  • UpdatedJan 13, 2024
  • Python

A financial chatbot powered by an LLM and retrieval-augmented generation.

  • UpdatedOct 2, 2023
  • Jupyter Notebook

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

  • UpdatedJul 27, 2024
  • Python

SOLAIRIA is a free tool with minimal dependencies that lets you interact with text-generation AI LLMs of your choice privately by running on your own local hardware offline.

  • UpdatedJun 24, 2025
  • Python

This is a adaptive RAG system decoupled ready for deployment and production

  • UpdatedSep 28, 2025
  • Python

A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat

  • UpdatedJun 4, 2024
  • Python
runpod-llm

Runpod-LLM provides ready-to-use container scripts for running large language models (LLMs) easily on RunPod.

  • UpdatedMay 20, 2025
  • Shell

Improve this page

Add a description, image, and links to thellama-cpp-python topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thellama-cpp-python topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp