Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

License

NotificationsYou must be signed in to change notification settings

wusize/ovdet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This is an official release of the paperAligning Bag of Regions for Open-Vocabulary Object Detection.

Aligning Bag of Regions for Open-Vocabulary Object Detection,
Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy
In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper][Supp][project page(TBD)][Bibetex]

Installation

This project is based onMMDetection 3.x

It requires the following OpenMMLab packages:

  • MMEngine >= 0.6.0
  • MMCV-full >= v2.0.0rc4
  • MMDetection >= v3.0.0rc6
  • lvisapi
pip install openmim mmenginemim install"mmcv>=2.0.0rc4"pip install git+https://github.com/lvis-dataset/lvis-api.gitmim install mmdet>=3.0.0rc6

License

This project is released under theNTU S-Lab License 1.0.

Usage

Obtain CLIP Checkpoints

We use CLIP's ViT-B-32 model for the implementation of our method. Obtain the state_dictof the model fromGoogleDrive andput it undercheckpoints. Otherwise,pip install git+https://github.com/openai/CLIP.git andrun

importclipimporttorchmodel,_=clip.load("ViT-B/32")torch.save(model.state_dict(),'checkpoints/clip_vitb32.pth')

Training and Testing

The training and testing onOV-COCO are supported now.

Citation

@inproceedings{wu2023baron,title={Aligning Bag of Regions for Open-Vocabulary Object Detection},author={Size Wu and Wenwei Zhang and Sheng Jin and Wentao Liu and Chen Change Loy},year={2023},booktitle={CVPR},}

[8]ページ先頭

©2009-2025 Movatter.jp