Movatterモバイル変換

Skip to content

#

llama-cpp

Here are 143 public repositories matching this topic...

Language:All

Filter by language

All143 Python59 Jupyter Notebook13 TypeScript11 C7 C++7 JavaScript7 Shell6 Dart4 C#3 Go3

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

getumbrel /llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai self-hosted openai llama gpt gpt-4 llm chatgpt llamacpp llama-cpp gpt4all localai llama2 llama-2 code-llama codellama

UpdatedApr 23, 2024
TypeScript

SciSharp /LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

UpdatedJul 11, 2025
C#

maid

Mobile-Artificial-Intelligence /maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

android facebook chatbot openai llama flutter mistral mobile-ai large-language-models chatgpt llamacpp llama-cpp free-chatgpt local-ai llama2 ollama gguf openorca mobile-artificial-intelligence android-ai

UpdatedMay 28, 2025
Dart

node-llama-cpp

withcatai /node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

nodejs cmake ai metal json-schema gpu vulkan grammar cuda self-hosted bindings llama embedding cmake-js prebuilt-binaries llm llama-cpp catai function-calling gguf

UpdatedJun 12, 2025
TypeScript

gotzmann /llama.go

llama.go is like llama.cpp in pure Golang!

llama gpt alpaca vicuna gpt3 gpt4 llm chatgpt dalai llama-cpp gpt4all

UpdatedSep 20, 2024
Go

undreamai /LLMUnity

Create characters in Unity with LLMs!

chat gamedev ai unity chatbot game-development dialogue unity3d character npc llama unity2d conversational-ai rag llm generative-ai llama-cpp

UpdatedJul 17, 2025
C#

Lizonghang /prima.cpp

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

llama-cpp llm-inference on-device-llms distributed-ai distributed-inference

UpdatedJul 18, 2025
C++

the-crypt-keeper /can-ai-code

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

UpdatedJun 21, 2025
Python

mybigday /llama.rn

React Native binding of llama.cpp

android ios react-native llama llm llama-cpp

UpdatedJul 9, 2025
C

withcatai /catai

Run AI ✨ assistant locally! with simple API for Node.js 🚀

nodejs ai chatbot openai chatui vicuna ai-assistant llm chatgpt dalai llama-cpp vicuna-installation-guide localai wizardlm local-llm catai ggmlv3 gguf node-llama-cpp

UpdatedJun 21, 2024
TypeScript

mdrokz /rust-llama.cpp

LLama.cpp rust bindings

rust machine-learning cpp model ffi crates-io llama api-bindings llama-cpp

UpdatedJun 27, 2024
Rust

dipampaul17 /KVSplit

Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit keys & 4-bit values, reducing memory by 59% with <1% quality loss. Includes benchmarking, visualization, and one-command setup. Optimized for M1/M2/M3 Macs with Metal support.

metal optimization quantization m2 m3 m1 memory-optimization kv-cache apple-silicon llm generative-ai llama-cpp

UpdatedMay 21, 2025
Python

jlonge4 /local_llama

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

python offline artificial-intelligence machinelearning langchain llama-cpp llamaindex

UpdatedJul 12, 2024
Python

docker /compose-for-agents

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

docker docker-compose examples openai-gym self-hosted ai-agents large-language-models llama-cpp agentic-workflows

UpdatedJul 18, 2025
TypeScript

gpustack /gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

go llama-cpp gguf stable-diffusion-cpp llama-box

UpdatedJul 18, 2025
Go

ptsochantaris /emeltal

Local ML voice chat using high-end models.

macos swift machine-learning natural-language-processing ai ml speech-recognition user-interface swiftui whisper-cpp llama-cpp

UpdatedJul 6, 2025
C++

phronmophobic /llama.clj

Run LLMs locally. A clojure wrapper for llama.cpp.

clojure llama llm llama-cpp

UpdatedMar 29, 2025
Clojure

gotzmann /booster

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

openai llama gpt llm chatgpt llamacpp llama-cpp vllm ggml exllama oobabooga ollama

UpdatedAug 15, 2024
C++

shady.ai

BrutalCoding /shady.ai

Making offline AI models accessible to all types of edge devices.

android windows macos linux dart ios web material-design cross-platform fastlane llvm flutter linux-desktop rwkv serverpod whisper-cpp llama-cpp gguf shady-ai llama-dart

UpdatedFeb 12, 2024
Dart

lucasjinreal /Crane

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

rust mllm llama-cpp qwen2-vl spark-tts qwen3

UpdatedJun 15, 2025
Rust

Improve this page

Add a description, image, and links to thellama-cpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thellama-cpp topic, visit your repo's landing page and select "manage topics."

[8]ページ先頭

©2009-2025 Movatter.jp