video-understanding
Here are 242 public repositories matching this topic...
Language:All
Sort:Most stars
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
- Updated
Aug 14, 2024 - Python
A curated list of action recognition and related area resources
- Updated
May 13, 2023
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
- Updated
Jan 18, 2025 - Python
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
- Updated
Jul 11, 2024 - Python
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
- Updated
Jun 16, 2025 - Python
An open-source toolbox for action understanding based on PyTorch
- Updated
Apr 8, 2022 - Python
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
- Updated
Feb 12, 2025 - Python
Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
- Updated
Oct 27, 2020 - Python
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
- Updated
Dec 8, 2023 - Python
SALMONN family: A suite of advanced multi-modal LLMs
- Updated
Jul 8, 2025
awesome grounding: A curated list of research papers in visual grounding
- Updated
Apr 9, 2023
Temporal Segment Networks (TSN) in PyTorch
- Updated
Jun 21, 2019 - Python
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
- Updated
Oct 16, 2024 - Python
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
- Updated
Jul 8, 2025 - Python
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
- Updated
Oct 8, 2024 - Python
temporal action detection with SSN
- Updated
Jun 21, 2019 - Python
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
- Updated
Dec 10, 2024 - Python
[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
- Updated
Jun 24, 2024 - Python
A lightning-fast, cross-platform AI chat application built with React Native.
- Updated
Jul 8, 2025 - TypeScript
🔥 🔥 🔥 A paper list of some recent Computer Vision(CV) works
- Updated
Jul 11, 2025
Improve this page
Add a description, image, and links to thevideo-understanding topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thevideo-understanding topic, visit your repo's landing page and select "manage topics."