- Notifications
You must be signed in to change notification settings - Fork17
facebookresearch/FAIR-Play
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
2.5D Visual Sound
Ruohan Gao1 andKristen Grauman2
1UT Austin,2Facebook AI Research
In Conference on Computer Vision and Pattern Recognition (CVPR), 2019
This repository (~100G) contains the FAIR-Play dataset we collected and used in ourCVPR 2019 paper. It contains 1,871 video clips and their corresponding binaural audio clips recorded in a music room. The video clip and binaural clip of the same index are roughly aligned. The splits directory contains the 10 random splits used in the paper. SeePseudoBinaural for 5 more challenging splits, where there are no or less scene overlap in the training and testing splits. The code is shared at2.5D Visual Sound Code.
- The dataset can be downloaded by cloning the repository uisng git lfs:
brew install git-lfsgit lfs clone git@github.com:facebookresearch/FAIR-Play.gitgit lfs installgit lfs pull- If you have trouble in downloading the dataset through GitHub, you can also download it using the following commands:
wget http://dl.fbaipublicfiles.com/FAIR-Play/videos.tar.gzwget http://dl.fbaipublicfiles.com/FAIR-Play/audios.tar.gzwget http://dl.fbaipublicfiles.com/FAIR-Play/splits.tar.gz- The dataset is also shared atUT Box.
If you find our data or project useful in your research, please cite:
@inproceedings{gao2019visualsound, title={2.5D Visual Sound}, author={Gao, Ruohan and Grauman, Kristen}, booktitle={CVPR}, year={2019} }We would like to thank Tony Miller, Jacob Donley, Pablo Hoffmann and Vladimir Tourbabin from Facebook for helpful discussions and the volunteers who participate in our data collection.
FAIR-Play is CC BY 4.0 licensed, as found in the LICENSE file.
About
2.5D visual sound dataset
Resources
License
Code of conduct
Contributing
Security policy
Uh oh!
There was an error while loading.Please reload this page.
