self-hosted-ai

Star

Here are 23 public repositories matching this topic...

Language:All

Filter by language

All23 Python11 TypeScript3 Batchfile1 Go1 JavaScript1 Rust1 Shell1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

ItzCrazyKns /Perplexica

Star29k

Perplexica is an AI-powered answering engine.

search-engine machine-learning artificial-intelligence ai-agents rag answering-engine searxng llm ai-search-engine open-source-ai-search-engine perplexica searxng-copilot self-hosted-ai

UpdatedFeb 13, 2026
TypeScript

thushan /olla

Sponsor

Star151

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

golang ai amd proxy intel self-hosted nvidia mlx llamacpp llama-cpp vllm llm-inference local-ai ollama llm-proxy lmstudio sglang llm-router llm-routing self-hosted-ai

UpdatedFeb 20, 2026
Go

agentem-ai /izwi

Star121

Local first speech AI engine for transcription, TTS, and voice workflows.

text-to-speech tts speech-to-text asr speaker-diarization voice-cloning local-first openai-compatible-api self-hosted-ai audio-inference

UpdatedFeb 20, 2026
Rust

tanaos /artifex

Star88

Small Language Model Inference, Fine-Tuning and Observability. No GPU, no labeled data needed.

sentiment-analysis text-classification named-entity-recognition emotion-detection intent-classification reranker text-anonymization pre-trained-models ai-observability llm-inference local-ai llm-finetuning task-specific-model small-language-models self-hosted-ai guardrail-models

UpdatedFeb 3, 2026
Python

SamurAIGPT /Vibe-Workflow

Star55

Free, open-source alternative to Weavy AI, Krea Nodes, Freepik Spaces & FloraFauna AI — node-based AI workflow builder for generative image & video pipelines

open-source ai nextjs image-generation workflow-automation node-editor video-generation fastapi artistic-intelligence creative-tools weavy generative-ai comfyui self-hosted-ai weavyai weavy-ai-alternative krea-nodes-alternative freepik-spaces-alternative florafauna-ai-alternative ai-workflow-builder

UpdatedFeb 19, 2026
JavaScript

OlgaKalinina101 /victor_ai_backend

Star17

emotional AI Companions for personal relationships

ai web-app postgresql personal-assistant help-wanted android-apps dialogue-systems ai-agents conversational-ai long-term-memory rag fastapi vector-search emotional-ai virtual-companion llm-apps open-source-ai private-ai ai-companion self-hosted-ai

UpdatedJan 21, 2026
Python

Run IBM Granite 4.0 locally on Raspberry Pi 5 with Ollama.This is a privacy-first AI. Your data never leaves your device because it runs 100% locally. There are no cloud uploads and no third-party tracking.

linux open-source raspberry-pi ai ibm arm64 embedded-linux ai-project edge-ai huggingface on-device-ai llm local-ai ollama small-language-models offline-ai private-ai self-hosted-ai mamba-2

UpdatedDec 30, 2025
Shell

recallium-ai /recallium

Star9

Recallium is a local, self-hosted universal AI memory system providing a persistent knowledge layer for developer tools (Copilot, Cursor, Claude Desktop). It eliminates "AI amnesia" by automatically capturing, clustering, and surfacing decisions and patterns across all projects. It uses the MCP for universal compatibility and ensures privacy

mcp pattern-recognition cursor windsurf developer-productivity cursor-ide claude-desktop cursor-ai self-hosted-ai mcp-server claude-code ai-coding-agent ai-memory-system llm-context-management persistent-knowledge-layer ai-amnesia-fix cross-project-learning cursor-memory claude-desktop-memory developer-privacy

UpdatedFeb 11, 2026
Batchfile

dwain-barnes /flowise-private-doc-chat-rag-blog

Star5

A private, local RAG (Retrieval-Augmented Generation) system using Flowise, Ollama, and open-source LLMs to chat with your documents securely and offline.

open-source data-privacy rag local-llm retrieval-augmented-generation flowise ollama pdf-chatbot flowise-ai offline-ai private-doc-chat self-hosted-ai

UpdatedNov 21, 2024

Mr-Dark-debug /sms-ai-agent

Star3

AI SMS Auto-Responder for Android. Turn your Android device into an autonomous AI communication hub. A Python-based SMS auto-responder running natively on Termux, powered by LLMs (OpenRouter/Ollama) with a sleek Web & Terminal UI.

python automation chatbot python3 termux ai-agents android-automation oep sms-bot termux-tools fastapi mobile-ai llm generative-ai openrouter groq-ai self-hosted-ai textual-ui

UpdatedFeb 15, 2026
Python

westailabs /nebulus-gantry

Star1

Self-hosted AI chat interface with RAG, long-term memory, and admin controls. Works with TabbyAPI, Ollama, vLLM, and any OpenAI-compatible API.

react docker typescript chatbot knowledge-base rag fastapi vector-database ai-assistant ai-chat chatgpt-ui chatgpt-alternative chromadb ollama llm-frontend enterprise-ai llm-ui private-ai tabbyapi self-hosted-ai

UpdatedFeb 17, 2026
Python

fiv3fingers /openclaw-telegram-ai-agent

Star1

Production-ready guide for connecting OpenClaw to a Telegram Bot. Build a self-hosted Telegram AI Agent using OpenClaw Gateway, pairing, and streaming responses.

telegram-bot ai-agent ai-automation llm ai-gateway telegram-integration telegram-ai-bot self-hosted-ai openclaw openclaw-telegram

UpdatedFeb 18, 2026

davewaring /Document-Processing-Service

Star0

Powers the local RAG pipeline in the BrainDrive Chat w/ Docs plugin.

mit-license rag localai rag-chatbot self-hosted-ai

UpdatedNov 16, 2025
Python

AntonioVFranco /elamonica

Star0

Production-ready test-time compute optimization framework for LLM inference. Implements Best-of-N, Sequential Revision, and Beam Search strategies. Validated with models up to 7B parameters.

machine-learning deep-learning optimization transformers inference pytorch llm llm-orchestration inference-scaling self-hosted-ai test-time-compute context-extension recursive-language-models compute-optimal-inference verifier-models

UpdatedJan 27, 2026
Python

meibraransari /7-Way-to-Run-Any-LLMs-Locally-Simple-Methods

Star0

🚀 7 Ways to Run Any LLMs Locally - Simple Methods

llm local-llm offline-ai private-ai self-hosted-ai local-llm-setup llm-setup run-llm-locally run-ai-locally deploy-ai-models-on-local-machine deploy-llm-using-docker-compose deploy-llm

UpdatedJan 25, 2026

alhemdrew /self-hosted-llm-infrastructure

Star0

Deployment of a self-hosted LLM infrastructure using Ollama and Open WebUI on Linux, including custom model creation, API integration, and system-level troubleshooting.

machine-learning llm prompt-engineering api-intergration local-llm ollama openwebui ai-deployment self-hosted-ai

UpdatedFeb 14, 2026

ifinspire /AIgentOS

Star0

Kernel-first, lightweight OSS for self-hosted multi-turn AI chat in Docker, with transparent runtime logic, baseline performance data, and split-licensed extensibility.

react docker open-source chatbot privacy-first vite fastapi performance-benchmark prompt-engineering local-llm ollama ai-infrastructure self-hosted-ai agent-kernel multi-turn-chat

UpdatedFeb 19, 2026
TypeScript

sawadkk /LocalPrompt

Star0

LocalPrompt is an AI-powered tool designed to refine and optimize AI prompts, helping users run locally hosted AI models like Mistral-7B for privacy and efficiency. Ideal for developers seeking to run LLMs locally without external APIs.

fastapi ai-development llm llama-cpp local-ai open-source-llm ai-prompt mistral7b offline-ai self-hosted-ai

UpdatedFeb 10, 2025
Python

ksm26 /Web-Based-Q-A-Tool

Star0

Web-Based Q&A Tool enables users to extract and query website content using FastAPI, FAISS, and a local TinyLlama-1.1B model—without external APIs. Built with React, it offers a minimal UI for seamless AI-driven search

qa machine-learning natural-language-processing ai embeddings llama react-js webscraping faiss qa-system fastapi llm self-hosted-ai tiny-llama

UpdatedMar 20, 2025
Python

tk-yasuno /cluster-rag-raptor

Star0

🌳 Open-source RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - Complete open-source implementation with 100% local LLMs (Granite Code 8B + mxbai-embed-large)

embedding-models raptor document-retrieval rag knowledge-tree local-llm retrieval-augmented-generation open-source-llm open-source-ai llm-integration granite-code self-hosted-ai domain-specific-rag hierarchical-retrieval recursive-processing mxbai-embed-large

UpdatedOct 18, 2025
Python

Improve this page

Add a description, image, and links to theself-hosted-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theself-hosted-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-hosted-ai

Here are 23 public repositories matching this topic...

ItzCrazyKns /Perplexica

thushan /olla

agentem-ai /izwi

tanaos /artifex

SamurAIGPT /Vibe-Workflow

OlgaKalinina101 /victor_ai_backend

Jewelzufo /granitepi-4-nano

recallium-ai /recallium

dwain-barnes /flowise-private-doc-chat-rag-blog

Mr-Dark-debug /sms-ai-agent

westailabs /nebulus-gantry

fiv3fingers /openclaw-telegram-ai-agent

davewaring /Document-Processing-Service

AntonioVFranco /elamonica

meibraransari /7-Way-to-Run-Any-LLMs-Locally-Simple-Methods

alhemdrew /self-hosted-llm-infrastructure

ifinspire /AIgentOS

sawadkk /LocalPrompt

ksm26 /Web-Based-Q-A-Tool

tk-yasuno /cluster-rag-raptor

Improve this page

Add this topic to your repo