MasterBin-IIAU/UNINEXTPublic

NotificationsYou must be signed in to change notification settings
Fork122
Star1.3k

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

License

MIT license

1.3k stars 122 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
assets		assets
conversion		conversion
detectron2		detectron2
dev		dev
docs		docs
external		external
projects		projects
tests		tests
tools		tools
tools_bin		tools_bin
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
launch.py		launch.py
setup.cfg		setup.cfg
setup.py		setup.py
uninext_vots.py		uninext_vots.py
vot_tool.py		vot_tool.py

Repository files navigation

Universal Instance Perception as Object Discovery and Retrieval

This is the official implementation of the paperUniversal Instance Perception as Object Discovery and Retrieval.

News

🏆 We are the runner-up inSegmentation in the Wild challenge.
🏆 We are the winner ofBDD100K MOT Challenge and the runner-up ofBDD MOTS Challenge on CVPR2023 workshop.

Highlight

UNINEXT is accepted byCVPR2023.
UNINEXT reformulates diverse instance perception tasks intoa unified object discovery and retrieval paradigm and can flexibly perceive different types of objects by simply changing the input prompts.
UNINEXT achievessuperior performance on 20 challenging benchmarks using a single model with the same model parameters.

Introduction

Object-centric understanding is one of the most essential and challenging problems in computer vision. In this work, we mainly discuss 10 sub-tasks, distributed on the vertices of the cube shown in the above figure. Since all these tasks aim to perceive instances of certain properties, UNINEXT reorganizes them into three types according to the different input prompts:

Category Names
- Object Detection
- Instance Segmentation
- Multiple Object Tracking (MOT)
- Multi-Object Tracking and Segmentation (MOTS)
- Video Instance Segmentation (VIS)
Language Expressions
- Referring Expression Comprehension (REC)
- Referring Expression Segmentation (RES)
- Referring Video Object Segmentation (R-VOS)
Target Annotations
- Single Object Tracking (SOT)
- Video Object Segmentation (VOS)

Then we propose a unified prompt-guided object discovery and retrieval formulationto solve all the above tasks. Extensiveexperiments demonstrate that UNINEXT achieves superior performance on 20 challenging benchmarks.

Demo

UNINEXT_DEMO_VID_9M.mp4

UNINEXT can flexibly perceive various types of objects by simply changing the input prompts, such as category names, language expressions, and target annotations. We also provide a simpledemo script, which supports 4 image-level tasks (object detection, instance segmentation, REC, RES).

Results

Retrieval by Category Names

Retrieval by Language Expressions

Retrieval by Target Annotations

Getting started

Installation: Please refer toINSTALL.md for more details.
Data preparation: Please refer toDATA.md for more details.
Training: Please refer toTRAIN.md for more details.
Testing: Please refer toTEST.md for more details.
Model zoo: Please refer toMODEL_ZOO.md for more details.

Citing UNINEXT

If you find UNINEXT useful in your research, please consider citing:

@inproceedings{UNINEXT,title={Universal Instance Perception as Object Discovery and Retrieval},author={Yan, Bin and Jiang, Yi and Wu, Jiannan and Wang, Dong and Yuan, Zehuan and Luo, Ping and Lu, Huchuan},booktitle={CVPR},year={2023}}

Acknowledgments

ThanksUnicorn for providing experience of unifying four object tracking tasks (SOT, MOT, VOS, MOTS).
ThanksVNext for providing experience of Video Instance Segmentation (VIS).
ThanksReferFormer for providing experience of REC, RES, and R-VOS.
ThanksGLIP for the idea of unifying object detection and phrase grounding.
ThanksDetic for the implementation of multi-dataset training.
Thanksdetrex for the implementation of denoising mechnism.

About

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

Universal Instance Perception as Object Discovery and Retrieval

News

Highlight

Introduction

Demo

Results

Retrieval by Category Names

Retrieval by Language Expressions

Retrieval by Target Annotations

Getting started

Citing UNINEXT

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages

Contributors2

Languages

Movatterモバイル変換

License

MasterBin-IIAU/UNINEXT

Folders and files

Latest commit

History

Repository files navigation

Universal Instance Perception as Object Discovery and Retrieval

News

Highlight

Introduction

Demo

Results

Retrieval by Category Names

Retrieval by Language Expressions

Retrieval by Target Annotations

Getting started

Citing UNINEXT

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Languages

Packages