PinnedLoading
- Recap-DataComp-1B
Recap-DataComp-1B Public[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
- MedTrinity-25M
MedTrinity-25M Public[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
- story-adapter
story-adapter PublicA Training-free Iterative Framework for Long Story Visualization
- VLAA-Thinking
VLAA-Thinking Public[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
- OpenVision
OpenVision Public[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
Repositories
- UCSC-VLAA.github.io Public
UCSC-VLAA/UCSC-VLAA.github.io’s past year of commit activity - OpenVision Public
[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
UCSC-VLAA/OpenVision’s past year of commit activity - EarthWhere Public
UCSC-VLAA/EarthWhere’s past year of commit activity - MedVLSynther Public
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
UCSC-VLAA/MedVLSynther’s past year of commit activity - VLAA-Thinking Public
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
UCSC-VLAA/VLAA-Thinking’s past year of commit activity - MedTrinity-25M Public
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
UCSC-VLAA/MedTrinity-25M’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.