Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

multi-modal

Here are 363 public repositories matching this topic...

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

  • UpdatedMar 3, 2025
  • Python

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow.https://activeloop.ai

  • UpdatedApr 23, 2025
  • Python

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

  • UpdatedApr 27, 2025
  • Python
modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

  • UpdatedApr 30, 2025
  • Python

Start building LLM-empowered multi-agent applications in an easier way.

  • UpdatedApr 30, 2025
  • Python

a state-of-the-art-level open visual language model | 多模态预训练模型

  • UpdatedMay 29, 2024
  • Python

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

  • UpdatedFeb 17, 2024
  • Python
marqo

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

  • UpdatedAug 23, 2024
  • Python

OmniGen: Unified Image Generation.https://arxiv.org/pdf/2409.11340

  • UpdatedFeb 20, 2025
  • Jupyter Notebook

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

  • UpdatedDec 3, 2024
  • Python

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

  • UpdatedApr 29, 2025
  • C#
docarray

GPT4V-level open-source multi-modal model based on Llama3-8B

  • UpdatedMar 3, 2025
  • Python

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

  • UpdatedApr 30, 2025
  • Python

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

  • UpdatedFeb 16, 2025
  • Python

Improve this page

Add a description, image, and links to themulti-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themulti-modal topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp