Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

mbzuai-oryx

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@mbzuai-oryx

ORYX

A Library for Large Vision-Language Models

Popular repositoriesLoading

  1. Video-ChatGPTVideo-ChatGPTPublic

    [ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

    Python 1.3k 111

  2. Awesome-LLM-Post-trainingAwesome-LLM-Post-trainingPublic

    Awesome Reasoning LLM Tutorial/Survey/Guide

    Python 1.1k 72

  3. groundingLMMgroundingLMMPublic

    [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

    Python 854 45

  4. LLaVA-ppLLaVA-ppPublic

    🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

    Python 835 62

  5. MobiLlamaMobiLlamaPublic

    MobiLlama : Small Language Model tailored for edge devices

    Python 628 48

  6. GeoChatGeoChatPublic

    [CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

    Python 535 46

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 28 repositories
  • LLMVoX Public

    LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

    mbzuai-oryx/LLMVoX’s past year of commit activity
    Python 207 23 2 0 UpdatedMar 20, 2025
  • KITAB-Bench Public

    A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

    mbzuai-oryx/KITAB-Bench’s past year of commit activity
    Python 31MIT0 0 0 UpdatedMar 20, 2025
  • mbzuai-oryx/TrackingMeetsLMM’s past year of commit activity
    Python 5 1 0 0 UpdatedMar 17, 2025
  • Awesome-LLM-Post-training Public

    Awesome Reasoning LLM Tutorial/Survey/Guide

    mbzuai-oryx/Awesome-LLM-Post-training’s past year of commit activity
    Python 1,118 72 0 0 UpdatedMar 17, 2025
  • TimeTravel Public

    Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts

    mbzuai-oryx/TimeTravel’s past year of commit activity
    Python 17MIT0 0 0 UpdatedMar 16, 2025
  • DriveLMM-o1 Public

    Reasoning DriveLMM

    mbzuai-oryx/DriveLMM-o1’s past year of commit activity
    Python 20 0 0 UpdatedMar 15, 2025
  • AIN Public

    AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding across diverse domains.

    mbzuai-oryx/AIN’s past year of commit activity
    HTML 35MIT 1 0 0 UpdatedMar 13, 2025
  • GeoPixel Public

    GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image analysis, offering advanced multi-target pixel grounding capabilities.

    mbzuai-oryx/GeoPixel’s past year of commit activity
    Python 70Apache-2.0 2 3 0 UpdatedMar 12, 2025
  • CoVR-VidLLM-CVPRW25 Public

    Composed Video Retrieval Challenge CVPR Workshop 2025

    mbzuai-oryx/CoVR-VidLLM-CVPRW25’s past year of commit activity
    Python 2 1 0 0 UpdatedMar 9, 2025
  • VideoGLaMM Public

    [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

    mbzuai-oryx/VideoGLaMM’s past year of commit activity
    Python 50 1 4 0 UpdatedMar 3, 2025

Top languages

Loading…

Most used topics

Loading…


[8]ページ先頭

©2009-2025 Movatter.jp