Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

[ICCV 2021] MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection

License

NotificationsYou must be signed in to change notification settings

czhang0528/MosaicOS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Mosaic ofObject-centric Images asScene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

Introduction

Many objects do not appear frequently enough in complex scenes (e.g., certain handbags in living rooms) fortraining an accurate object detector, but are often found frequently by themselves (e.g., in product images).Yet, these object-centric images are not effectively leveraged for improving object detection in scene-centricimages.

We propose Mosaic of Object-centric images as Scene-centric images (MosaicOS), a simple and novel framework that is surprisingly effective at tackling the challenges of long-tailed object detection. Keys to our approachare three-fold: (i) pseudo scene-centric image construction from object-centric images for mitigating domain differences, (ii) high-quality bounding box imputation usingthe object-centric images’ class labels, and (iii) a multistage training procedure. Check our paper for further details:

MosaicOS:A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection. In IEEE/CVF International Conference on Computer Vision (ICCV), 2021.

byCheng Zhang*,Tai-Yu Pan*,Yandong Li,Hexiang Hu,Dong Xuan,Soravit Changpinyo,Boqing Gong,Wei-Lun Chao.

Mosaics

The scriptmosaic.py generates mosaic images and annotaions by given an annotation file in COCO format (for more informationhere). The following command will generate 2x2 mosaic images and the annotation file for COCO training dataset inOUTPUT_DIR/images/ andOUTPUT_DIR/annotation.json with 4 processors.--shuffle is to shuffle the order of images to synthesize and--drop-last is to drop the last couple of images if they are not enough fornrow * ncol.--demo 10 plots 10 synthesized images with annotated boxes inOUTPUT_DIR/demo/ for visualization.

 python mosaic.py --coco-file datasets/coco/annotations/instances_train2017.json --img-dir datasets/coco --output-dir output_mosaics --num-proc 4 --nrow 2 --ncol 2 --shuffle --drop-last --demo 10

*Note: In our work, we sythesize mosaics from object-centric images with pseudo bounding box to find-tune the pre-trained detector.

Pre-trained models

Our impelementation is based onDetectron2.All models are trained onLVIS training set withRepeated FactorSampling (RFS).

LVIS v0.5 validation set

  • Object detection
BackboneMethodAPbAPbrAPbcAPbfDownload
R50-FPNFaster R-CNN23.413.022.628.4model
R50-FPNMosaicOS25.020.223.928.3model
  • Instance segmentation
BackboneMethodAPAPrAPcAPfAPbDownload
R50-FPNMask R-CNN24.416.024.028.323.6model
R50-FPNMosaicOS26.319.726.628.525.8model

LVIS v1.0 validation set

  • Object detection
BackboneMethodAPbAPbrAPbcAPbfDownload
R50-FPNFaster R-CNN22.010.620.129.2model
R50-FPNMosaicOS23.915.522.429.3model
  • Instance segmentation
BackboneMethodAPAPrAPcAPfAPbDownload
R50-FPNMask R-CNN22.612.321.328.623.3model
R50-FPNMosaicOS24.518.223.028.825.1model
R101-FPNMask R-CNN24.815.223.730.325.5model
R101-FPNMosaicOS26.720.525.830.527.4model
X101-FPNMask R-CNN26.717.625.631.927.4model
X101-FPNMosaicOS28.321.827.232.428.9model

Citation

Please cite with the following bibtex if you find it useful.

@inproceedings{zhang2021mosaicos,  title={{MosaicOS}: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection},  author={Zhang, Cheng and Pan, Tai-Yu and Li, Yandong and Hu, Hexiang and Xuan, Dong and Changpinyo, Soravit and Gong, Boqing and Chao, Wei-Lun},  booktitle = {ICCV},  year={2021}}

Questions

Feel free to email us if you have any questions.

Cheng Zhang (zhang.7804@osu.edu), Tai-Yu Pan (pan.667@osu.edu), Wei-Lun Harry Chao (chao.209@osu.edu)

About

[ICCV 2021] MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp