hassony2/obman_trainPublic

NotificationsYou must be signed in to change notification settings
Fork26
Star195

[cvpr19] Demo, training and evaluation code for generating dense hand+object reconstructions from single rgb images

License

GPL-3.0 license

195 stars 26 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
assets		assets
handobjectdatasets		handobjectdatasets
mano_train		mano_train
readme_assets/images		readme_assets/images
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
image_demo.py		image_demo.py
pyproject.toml		pyproject.toml
traineval.py		traineval.py
webcam_demo.py		webcam_demo.py

Repository files navigation

Learning Joint Reconstruction of Hands and Manipulated Objects - Demo, Training Code and Models

Yana Hasson, Gül Varol, Dimitris Tzionas, Igor Kalevatykh, Michael J. Black, Ivan Laptev, Cordelia Schmid, CVPR 2019

Get the code

git clone https://github.com/hassony2/obman_traincd obman_train

Download and prepare datasets

Download the ObMan dataset

Request the dataset on thedataset page
Create asymlinkln -s path/to/downloaded/obman datasymlinks/obman
Download ShapeNetCore v2 object meshes from theShapeNet official website
Create a symlinkln -s /sequoia/data2/dataset/shapenet/ShapeNetCore.v2 datasymlinks/ShapeNetCore.v2

Your data structure should now look like

obman_train/  datasymlinks/ShapeNetCore.v2  datasymlinks/obman

Download the First-Person Hand Action Benchmark dataset

Follow theofficial instructions to download the dataset

Download model files

Download model files fromherewget http://www.di.ens.fr/willow/research/obman/release_models.zip
unzipunzip release_models.zip

Install python dependencies

create conda environment with dependencies:conda env create -f environment.yml
activate environment:conda activate obman_train

Install the MANO PyTorch layer

Follow the instructions fromhere

Download the MANO model files

Go toMANO website
Create an account by clickingSign Up and provide your information
Download Models and Code (the downloaded file should have the format mano_v*_*.zip). Note that all code and data from this download falls under theMANO license.
unzip and copy the content of themodels folder into the misc/mano folder
Your structure should look like this:

obman_train/  misc/    mano/      MANO_LEFT.pkl      MANO_RIGHT.pkl  release_models/    fhb/    obman/    hands_only/

Launch

Demo

We provide a model trained on the synthetic ObMan dataset

Single image demo

python image_demo.py --resume release_models/obman/checkpoint.pth.tar

In this demo, both the original and flipped inputs are fed, and the outputs are therefore presented for the input treated as a right and a left hand side by side.

Running the demo should produce the following outputs.

You can also run this demo on data from theFirst Hand Action Benchmark

python image_demo.py --image_path readme_assets/images/fhb_liquid_soap.jpeg --resume release_models/fhb/checkpoint.pth.tar

Note that the model trained on First Hand Action Benchmark strongly overfits to this dataset, and therefore performs poorly on 'in the wild' images.

Video demo

You can test it on a recorded video or live using a webcam by launching :

python webcam_demo.py --resume release_models/obman/checkpoint.pth.tar --hand_side left

Hand side detection is not handled in this pipeline, therefore, you should explicitly indicate whether you want to use the right or left hand with--hand_side.

Note that the video demo has some lag time, which comes from the visualization bottleneck (matplotlib image rendering is quite slow).

Limitations

This demo doesn't operate hand detection, so the model expects a roughly centered hand
As we are deforming a sphere, the topology of the object is 0, which explains results such as the following:

the model is trained only on hands holding objects, and therefore doesn't perform well on hands in the absence of objects for poses that do not resemble common grasp poses.
the model is trained on grasping hands only, and therefore struggles with hand poses that are associated with object-handling
- In addition to the models, we also provide a hand-only model trained on various hand datasets, including our ObMan dataset, that captures a wider variety of hand poses
- to try it, launchpython webcam_demo.py --resume release_models/hands_only/checkpoint.pth.tar
- Note that this model also regresses a translation and scale parameter that allows to overlay the predicted 2D joints on the images according to an orthographic projection model

Training

python traineval.py --atlas_predict_trans --atlas_predict_scale --atlas_mesh --mano_use_shape --mano_use_pca --freeze_batchnorm --atlas_separate_encoder

Citations

If you find this code useful for your research, consider citing:

@INPROCEEDINGS{hasson19_obman,  title     = {Learning joint reconstruction of hands and manipulated objects},  author    = {Hasson, Yana and Varol, G{\"u}l and Tzionas, Dimitris and Kalevatykh, Igor and Black, Michael J. and Laptev, Ivan and Schmid, Cordelia},  booktitle = {CVPR},  year      = {2019}}

Acknowledgements

AtlasNet code

Code related toAtlasNet is in large part adapted from the officialAtlasNet repository.ThanksThibault for the provided code !

Hand evaluation code

Code for computing hand evaluation metrics was reused fromhand3d, courtesy ofChristian Zimmermann with an easy-to-use interface!

Laplacian regularization loss

Code for the laplacian regularization and precious advice was provided byAngjoo Kanazawa !

First Hand Action Benchmark dataset

Helpful advice to work with the dataset was provided byGuillermo Garcia-Hernando !

About

[cvpr19] Demo, training and evaluation code for generating dense hand+object reconstructions from single rgb images

hassony2.github.io/obman.html

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

License

hassony2/obman_train

Folders and files

Latest commit

History

Repository files navigation

Learning Joint Reconstruction of Hands and Manipulated Objects - Demo, Training Code and Models

Get the code

Download and prepare datasets

Download the ObMan dataset

Download the First-Person Hand Action Benchmark dataset

Download model files

Install python dependencies

Install the MANO PyTorch layer

Download the MANO model files

Launch

Demo

Single image demo

Video demo

Limitations

Training

Citations

Acknowledgements

AtlasNet code

Hand evaluation code

Laplacian regularization loss

First Hand Action Benchmark dataset

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages