Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

vlms

Here are 90 public repositories matching this topic...

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

  • UpdatedFeb 20, 2026
  • Python

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

  • UpdatedFeb 6, 2026

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantization, MXFP4, NVFP4, GGUF, and adaptive schemes.

  • UpdatedFeb 14, 2026
  • Python

Official repository for VisionZip (CVPR 2025)

  • UpdatedJul 21, 2025
  • Python

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

  • UpdatedOct 14, 2025
  • Python

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

  • UpdatedJun 9, 2024
  • Python

This repository collects research papers of large Foundation Models for Scenario Generation and Analysis in Autonomous Driving. The repository will be continuously updated to track the latest update.

  • UpdatedFeb 6, 2026

Open-source tools for training and evaluating Vision Language Models for OCR

  • UpdatedJan 28, 2026
  • Python

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

  • UpdatedOct 5, 2024
  • Python

[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

  • UpdatedAug 8, 2025
  • Python

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

  • UpdatedMay 24, 2025
  • Python

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

  • UpdatedSep 26, 2024
  • Python

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments

  • UpdatedFeb 14, 2025
  • Python

Implementing scalable LLMs in pure JAX (no third-party libraries)

  • UpdatedFeb 19, 2026
  • Python

Code for our ICCV 2025 paper "CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers."

  • UpdatedOct 30, 2025
  • Python

Improve this page

Add a description, image, and links to thevlms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thevlms topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp