VLAA@UCSC

PinnedLoading

Recap-DataComp-1BRecap-DataComp-1BPublic
[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
145 1
MedTrinity-25MMedTrinity-25MPublic
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Python 391 27
story-adapterstory-adapterPublic
A Training-free Iterative Framework for Long Story Visualization
Python 936 131
MedReasonMedReasonPublic
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Python 245 19
VLAA-ThinkingVLAA-ThinkingPublic
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Python 145 1
OpenVisionOpenVisionPublic
[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
Python 409 20

Repositories

Showing 10 of 39 repositories

UCSC-VLAA.github.io Public
UCSC-VLAA/UCSC-VLAA.github.io’s past year of commit activity
JavaScript00 0 0 UpdatedDec 8, 2025
ViLBench Public
[EMNLP'25] Official Python Implementation of ViLBench: A Suite for Vision-Language Process Reward Modeling
UCSC-VLAA/ViLBench’s past year of commit activity
Python 10 0 0 UpdatedDec 8, 2025
OpenVision Public
[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
UCSC-VLAA/OpenVision’s past year of commit activity
Python 409Apache-2.0 20 5 0 UpdatedDec 1, 2025
MeDiM Public
UCSC-VLAA/MeDiM’s past year of commit activity
Python 22MIT0 1 0 UpdatedDec 1, 2025
EarthWhere Public
UCSC-VLAA/EarthWhere’s past year of commit activity
Python 140 0 0 UpdatedNov 15, 2025
MedVLThinker Public
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
UCSC-VLAA/MedVLThinker’s past year of commit activity
Jupyter Notebook 44Apache-2.0 2 0 0 UpdatedNov 1, 2025
MedVLSynther Public
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
UCSC-VLAA/MedVLSynther’s past year of commit activity
Python 9Apache-2.00 0 0 UpdatedNov 1, 2025
VLAA-Thinking Public
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
UCSC-VLAA/VLAA-Thinking’s past year of commit activity
Python 145Apache-2.0 1 3 0 UpdatedOct 10, 2025
MedTrinity-25M Public
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
UCSC-VLAA/MedTrinity-25M’s past year of commit activity
Python 391 27 11 0 UpdatedJul 11, 2025
MedReason Public
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
UCSC-VLAA/MedReason’s past year of commit activity
Python 245 19 1 0 UpdatedJun 19, 2025