gpt-vision
Here are 26 public repositories matching this topic...
Language:All
Sort:Most stars
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.
- Updated
Feb 25, 2025 - Java
Your personal voice assistant based on OpenAI ChatGPT.
- Updated
Mar 13, 2025 - Kotlin
A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.
- Updated
Sep 6, 2024 - Python
MinimalChat is a lightweight, open-source chat application that allows you to interact with various large language models.
- Updated
Feb 28, 2025 - Vue
集成 GPT 问答、Midjourney 绘画等一站式服务的系统
- Updated
Apr 3, 2024 - Vue
A simple matrix bot that supports image generation and chatting using ChatGPT
- Updated
Mar 6, 2025 - Python
Web version of SpeakGPT created using ReactJS and Google Material Design 3.
- Updated
Oct 5, 2024 - JavaScript
Convert PDF to Markdown via OpenAI multi-modal text/vision model.
- Updated
Dec 30, 2024 - Python
autoPDFtagger is a Python tool designed for efficient home-office organization, focusing on digitizing and organizing both digital and paper-based documents. By automating the tagging of PDF files, including image-rich documents and scans of varying quality, it aims to streamline the organization of digital archives.
- Updated
Jan 1, 2024 - Python
A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.
- Updated
Nov 9, 2023 - JavaScript
Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.
- Updated
Nov 22, 2023 - Python
Create AWS infrastructure using architecture diagrams and natural language interpreted using the OpenAI GPT model.
- Updated
Nov 18, 2023 - Python
A powerful AI package (built using typescript), inspired by @rizzlogy/bardie, for interacting with the Google Bard API - without needing to set your own cookie!
- Updated
Jan 1, 2024 - TypeScript
- Updated
Dec 3, 2023 - Python
Create interactive polls directly from the whiteboard content. Built on top of tldraw make-real template and live audio-video by 100ms, it uses OpenAI's GPT Vision to create an appropriate question with options to launch a poll instantly that helps engage the audience.
- Updated
Mar 14, 2024 - TypeScript
Auto caption images for training in Stable Diffusion
- Updated
Apr 9, 2024 - Python
AI Powered Invoice Processing! Capture data effectively through contextual OCR and then ask your AI assistant about your own past purchases.
- Updated
Feb 27, 2025 - Python
Improve this page
Add a description, image, and links to thegpt-vision topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thegpt-vision topic, visit your repo's landing page and select "manage topics."