Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

visual-language-learning

Here are 14 public repositories matching this topic...

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

  • UpdatedAug 12, 2024
  • Python

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

  • UpdatedMay 13, 2025
  • Python
Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

  • UpdatedMar 5, 2024
  • Python

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

  • UpdatedSep 11, 2024
  • Python

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

  • UpdatedApr 14, 2024
  • Python

🧘🏻‍♂️KarmaVLM (相生):A family of high efficiency and powerful visual language model.

  • UpdatedApr 29, 2024
  • Python

Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖

  • UpdatedJun 19, 2024
  • Python

[ACM MMGR '24] 🔍 Shotluck Holmes: A family of small-scale LLVMs for shot-level video understanding

  • UpdatedOct 26, 2024
  • Python

PyTorch implementation of OpenAI's CLIP model for image classification, visual search, and visual question answering (VQA).

  • UpdatedSep 14, 2024
  • Jupyter Notebook

Efficient Video Question Answering

  • UpdatedJan 19, 2023
  • Python

Improve this page

Add a description, image, and links to thevisual-language-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thevisual-language-learning topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp