llamacpp

Star

Here are 702 public repositories matching this topic...

Language:All

Filter by language

All702 Python267 C++66 TypeScript62 Jupyter Notebook44 JavaScript33 Rust31 Go26 Shell23 C#18 HTML13

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

janhq /jan

Star40.5k

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

open-source self-hosted gpt tauri llm chatgpt llamacpp localai

UpdatedFeb 19, 2026
TypeScript

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

chat agent productivity research ai emacs self-hosted assistant image-generation obsidian stt semantic-search rag obsidian-md llm chatgpt whatsapp-ai llamacpp offline-llm llama3

UpdatedJan 6, 2026
Python

llmware-ai /llmware

Sponsor

Star14.9k

Unified framework for building enterprise RAG pipelines with small, specialized models

parsing agents onnx openvino llm llamacpp retrieval-augmented-generation generative-ai-tools small-specialized-models

UpdatedFeb 19, 2026
Python

getumbrel /llama-gpt

Star11k

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai self-hosted openai llama gpt gpt-4 llm chatgpt llamacpp llama-cpp gpt4all localai llama2 llama-2 code-llama codellama

UpdatedApr 23, 2024
TypeScript

LostRuins /koboldcpp

Star9.5k

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

UpdatedFeb 20, 2026
C++

xorbitsai /inference

Star9.1k

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

machine-learning deployment inference pytorch artificial-intelligence llama whisper gemma mistral openai-api llm flan-t5 chatglm llamacpp vllm ggml wizardlm qwen llama3 glm4

UpdatedFeb 20, 2026
Python

RunanywhereAI /runanywhere-sdks

Star9k

Production ready toolkit to run AI locally

android kotlin swift ios react-native web cpp inference edge flutter websdk vlm multimodal diffusion-models on-device-ai voice-ai llm llamacpp ollama apple-intelligence

UpdatedFeb 20, 2026
C++

reorproject /reor

Star8.5k

Private & local AI personal knowledge management app for high entropy people.

markdown ai llama note-taking pkm rag local-first vector-database second-brain llamacpp lancedb ollama

UpdatedMay 13, 2025
JavaScript

serge-chat /serge

Star5.7k

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

python docker nginx web svelte llama alpaca tailwindcss fastapi sveltekit llamacpp

UpdatedNov 21, 2025
Svelte

cactus-compute /cactus

Star4.3k

Low-latency AI inference engine for mobile devices & wearables

android ios mobile framework ai speech edge transformer smartphone whisper llm llms llamacpp llm-inference

UpdatedFeb 20, 2026
C

gptme /gptme

Star4.2k

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

agent cli chatbot openai code-generation agents ai-agents rag ai-assistant llm chatgpt anthropic llamacpp llm-agent openrouter llm-apps

UpdatedFeb 20, 2026
Python

JohnSnowLabs /spark-nlp

Star4.1k

State of the Art Natural Language Processing

nlp natural-language-processing spark sentiment-analysis text-classification tensorflow machine-translation transformers language-detection pyspark named-entity-recognition question-answering lemmatizer spell-checker bert part-of-speech-tagger entity-extraction onnx llm llamacpp

UpdatedFeb 17, 2026
Scala

Michael-A-Kuykendall /shimmy

Sponsor

Star3.7k

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

rust machine-learning transformers api-server developer-tools llama command-line-tool lora inference-server rust-crate huggingface huggingface-transformers huggingface-models llamacpp llm-inference local-ai gguf ollama-api openai-compatible

UpdatedJan 16, 2026
Rust

twinnydotdev /twinny

Star3.6k

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.

artificial-intelligence private free vscode-extension code-generation symmetry code-completion copilot code-chat llamacpp llama2 ollama codellama ollama-chat ollama-api

UpdatedAug 7, 2025
TypeScript

SciSharp /LLamaSharp

Star3.5k

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

UpdatedFeb 17, 2026
C#

Josh-XT /AGiXT

Sponsor

Star3.2k

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

python intelligence automation ai agi openai artificial llama llm llmops llamacpp chromadb agixt agent-llm

UpdatedFeb 20, 2026
Python

SilasMarvin /lsp-ai

Star3.1k

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

ai ide self-hosted openai developer-tools llama auto-completion mistral lsp language-client llm llamacpp

UpdatedJan 7, 2025
Rust

janhq /cortex.cpp

Star2.8k

Local AI API Platform

onnx onnxruntime llamacpp gguf

UpdatedJul 4, 2025
C++

containers /ramalama

Star2.6k

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

ai containers cuda intel hip hacktoberfest inference-server podman llm llamacpp vllm

UpdatedFeb 20, 2026
Python

mostlygeek /llama-swap

Star2.4k

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

golang openai llama openai-api llamacpp vllm localllm localllama

UpdatedFeb 20, 2026
Go

Improve this page

Add a description, image, and links to thellamacpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thellamacpp topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamacpp

Here are 702 public repositories matching this topic...

janhq /jan

khoj-ai /khoj

llmware-ai /llmware

getumbrel /llama-gpt

LostRuins /koboldcpp

xorbitsai /inference

RunanywhereAI /runanywhere-sdks

reorproject /reor

serge-chat /serge

cactus-compute /cactus

gptme /gptme

JohnSnowLabs /spark-nlp

Michael-A-Kuykendall /shimmy

twinnydotdev /twinny

SciSharp /LLamaSharp

Josh-XT /AGiXT

SilasMarvin /lsp-ai

janhq /cortex.cpp

containers /ramalama

mostlygeek /llama-swap

Improve this page

Add this topic to your repo