Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

multi-modality

Here are 85 public repositories matching this topic...

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

  • UpdatedAug 12, 2024
  • Python
clip-as-serviceswarms

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created byhttps://twitter.com/advadnoun

  • UpdatedMar 13, 2022
  • Python
Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

  • UpdatedMar 5, 2024
  • Python

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

  • UpdatedApr 21, 2024
  • Python

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

  • UpdatedMar 17, 2025
  • Python

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

  • UpdatedJun 5, 2024
  • Python

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

  • UpdatedJun 4, 2024
  • Python

[CVPR 2025] MINIMA: Modality Invariant Image Matching

  • UpdatedMar 14, 2025
  • Python

An official PyTorch implementation of the CRIS paper

  • UpdatedJun 9, 2024
  • Python

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

  • UpdatedSep 11, 2024
  • Python

Official repository for VisionZip (CVPR 2025)

  • UpdatedFeb 27, 2025
  • Python

[ICCV2019] Robust Multi-Modality Multi-Object Tracking

  • UpdatedDec 7, 2019
  • Python

Improve this page

Add a description, image, and links to themulti-modality topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themulti-modality topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp