Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

multi-modal

Here are 455 public repositories matching this topic...

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

  • UpdatedFeb 7, 2026
  • Python

AgentScope: Agent-Oriented Programming for Building LLM Applications

  • UpdatedFeb 7, 2026
  • Python

Open-source framework for conversational voice AI agents

  • UpdatedFeb 7, 2026
  • Python

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

  • UpdatedSep 22, 2025
  • Python

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow.https://activeloop.ai

  • UpdatedFeb 7, 2026
  • C++
modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

  • UpdatedJan 24, 2026
  • Python
big-AGI

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

  • UpdatedFeb 7, 2026
  • TypeScript

a state-of-the-art-level open visual language model | 多模态预训练模型

  • UpdatedMay 29, 2024
  • Python

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

  • UpdatedFeb 17, 2024
  • Python
marqo

Ecommerce Search and Discovery - marqo.ai

  • UpdatedFeb 7, 2026
  • Python

OmniGen: Unified Image Generation.https://arxiv.org/pdf/2409.11340

  • UpdatedDec 4, 2025
  • Jupyter Notebook

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

  • UpdatedAug 23, 2024
  • Python

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

  • UpdatedFeb 3, 2026
  • Python

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

  • UpdatedFeb 1, 2026
  • C#

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

  • UpdatedDec 3, 2024
  • Python
docarray

Improve this page

Add a description, image, and links to themulti-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themulti-modal topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp