referring-expression-segmentation
Here are 16 public repositories matching this topic...
Language:All
Sort:Most stars
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
- Updated
Jul 18, 2023 - Python
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
- Updated
Oct 21, 2024 - Python
[CVPR 2023 Highlight & IJCV 2026] GRES: Generalized Referring Expression Segmentation
- Updated
Nov 26, 2025 - Python
[ICCV 2023 & TPAMI 2025] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
- Updated
Jan 8, 2026 - Python
A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]
- Updated
Nov 14, 2025 - Python
Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)
- Updated
Dec 2, 2024 - Python
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
- Updated
Aug 4, 2022 - Python
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation
- Updated
Jun 23, 2025 - Jupyter Notebook
A lightweight codebase for referring expression comprehension and segmentation
- Updated
May 21, 2022 - Python
Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”
- Updated
Dec 5, 2024 - Python
[NeurIPS 2025] "SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation"https://arxiv.org/pdf/2510.10160
- Updated
Nov 26, 2025 - Python
[NeurIPS 2025] "SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation"https://arxiv.org/pdf/2510.10160
- Updated
Nov 27, 2025 - Python
[MULA Workshop @ CVPR 2022] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
- Updated
Jun 28, 2022 - Jupyter Notebook
VIPA: Visual Informative Part Attention Framework for Transformer-based Referring Image Segmentation
- Updated
Mar 28, 2025
PyTorch implementation of Google's PaliGemma vision-language model with VQ-VAE decoder for processing referring expression segmentation outputs. Supports detection, segmentation, VQA, and captioning.
- Updated
Nov 13, 2025 - Python
🌟 Build a PyTorch implementation of Google's PaliGemma model for advanced vision-language tasks, including object detection and segmentation.
- Updated
Feb 20, 2026 - Python
Improve this page
Add a description, image, and links to thereferring-expression-segmentation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thereferring-expression-segmentation topic, visit your repo's landing page and select "manage topics."