Welcome to the Vision Lab @ HUST!

🙋‍♀️ Introduction

Hello! This is the GitHub space for theVision Lab led byProfessorXinggang Wang. We are based at theArtificial Intelligence Institute, School of Electronic Information and Communications, Huazhong University of Science and Technology (HUST).

Our research focuses oncomputer vision and deep learning. We are particularly interested in:

Multimodal Foundation Models
Visual Representation Learning
Object Detection, Segmentation, and Tracking
End-to-end Autonomous Driving
Novel Neural Architectures

Our group strives to push the boundaries of visual intelligence and has produced highly influential works in the field, includingCCNet,Mask Scoring R-CNN,FairMOT,ByteTrack,EVA,MapTR,Vectorized Autonomous Driving (VAD),DiffusionDrive,Vision Mamba (Vim),4D Gaussian Splatting (4DGS),YOLOS,YOLO-World, andLightningDiT & VA-VAE.

🌈 Contribution Guidelines & Collaboration

We actively contribute to the research community through publications and open-source projects.

Research Collaboration: We are open to collaborations in our areas of interest. Please feel free to reach out to Prof. Xinggang Wang (xgwang # hust.edu.cn).
Prospective Students: Our group has a strong track record of mentoring Ph.D. and Master's students who lead impactful publications. Interested students can find more information on Prof. Wang's faculty page.
Using Our Code: You are welcome to explore and use the code in our repositories. Please ensure you cite the corresponding publications appropriately. Specific details can usually be found in the README files of individual repositories.
Contributing to Projects: For guidelines on contributing to specific projects (e.g., bug reports, pull requests), please check the individual repositories.

👩‍💻 Useful Resources

Prof. Wang's HUST Faculty Webpage:http://faculty.hust.edu.cn/xwang - Find Prof. Wang's official profile and contact information.
Personal Homepage (also this GitHub Pages site):https://xwcv.github.io - Contains information on research, publications, and more.
Google Scholar Profile:https://scholar.google.com/citations?hl=en&user=qNCTLV0AAAAJ&view_op=list_works - View a comprehensive list of publications and citations.
Repositories: Explore the repositories within this GitHub organization for code and datasets related to our research publications.

PinnedLoading

VimVimPublic
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Python 3.8k 274
LightningDiTLightningDiTPublic
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Python 1.4k 53
4DGaussians4DGaussiansPublic
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Jupyter Notebook 3.4k 325
VADVADPublic
[ICCV 2023 & ICLR 2026] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
Python 1.2k 141
MapTRMapTRPublic
[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
Python 1.5k 236
DiffusionDriveDiffusionDrivePublic
[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
Python 1.3k 121

Repositories

Showing 10 of 119 repositories

InfiniteVL Public
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
hustvl/InfiniteVL’s past year of commit activity
Python 84Apache-2.0 4 1 0 UpdatedFeb 2, 2026
VAD Public
[ICCV 2023 & ICLR 2026] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
hustvl/VAD’s past year of commit activity
Python 1,229Apache-2.0 142 76 1 UpdatedJan 31, 2026
VGT Public
Visual Generation Tuning
hustvl/VGT’s past year of commit activity
Python 97MIT0 1 0 UpdatedJan 27, 2026
MobileI2V Public
[ArXiv 2025] MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
hustvl/MobileI2V’s past year of commit activity
Python 68 2 1 0 UpdatedJan 5, 2026
GaussTR Public
[CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
hustvl/GaussTR’s past year of commit activity
Python 207MIT 11 1 0 UpdatedJan 5, 2026
DiffusionDriveV2 Public
DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
hustvl/DiffusionDriveV2’s past year of commit activity
Python 237MIT 21 10 2 UpdatedDec 29, 2025
SuperCLIP Public
hustvl/SuperCLIP’s past year of commit activity
Python 122Apache-2.0 6 3 0 UpdatedDec 26, 2025
DiffusionVL Public
[ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
hustvl/DiffusionVL’s past year of commit activity
Python 131Apache-2.0 5 3 0 UpdatedDec 25, 2025
TBCM Public
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs
hustvl/TBCM’s past year of commit activity
Python 210 1 0 UpdatedDec 16, 2025
LightningDiT Public
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
hustvl/LightningDiT’s past year of commit activity
Python 1,397MIT 53 18 1 UpdatedDec 16, 2025