isarandi/metrabsPublic

NotificationsYou must be signed in to change notification settings
Fork73
Star520

Estimate absolute 3D human poses from RGB images.

License

MIT license

520 stars 73 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
demos		demos
docs		docs
img		img
metrabs_pytorch		metrabs_pytorch
metrabs_tf		metrabs_tf
LICENSE		LICENSE
README.md		README.md
demo.png		demo.png
environment.yml		environment.yml
install_dependencies.sh		install_dependencies.sh
metrabs_demo.ipynb		metrabs_demo.ipynb
test_image_3dpw.jpg		test_image_3dpw.jpg
thumbnail_video.png		thumbnail_video.png

Repository files navigation

MeTRAbs Absolute 3D Human Pose Estimator

This repository contains code for the following paper:

MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation
by István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
IEEE Transactions on Biometrics, Behavior, and Identity Science (T-BIOM), Selected Best Works FromAutomated Face and Gesture Recognition 2020.

The repo has been updated to an improved version employed in the following paper:

Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats
by István Sárándi, Alexander Hermans, Bastian Leibe
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023.

News

[2023-08-02] Major codebase refactoring, models as described in ourWACV'23 paper, several components factored out into separate repos, PyTorch support for inference, and more.
[2021-12-03] Added new backbones, including the ResNet family from ResNet-18 to ResNet-152
[2021-10-19] Released new best-performingmodels based on EfficientNetV2 and super fastones using MobileNetV3, simplifiedAPI, multiple skeleton conventions, support forradial/tangential distortion, improved antialiasing, plausibility filtering and other newfeatures.
[2021-10-19] Full codebase migrated to TensorFlow 2 and Keras
[2020-11-19] Oral presentation at the IEEE Conference on Automatic Face and Gesture Recognition(FG'20) (Talk VideoandSlides)
[2020-11-16] Training and evaluation code now released along with dataset pre-processing scripts!Code and models upgraded to Tensorflow 2.
[2020-10-06]Journal paper accepted for publication in theIEEE Transactions on Biometrics, Behavior, and Identity Science (T-BIOM), Best of FG Special Issue
[2020-08-23] Short presentation at ECCV2020's 3DPWworkshop (slides)
[2020-08-06] Our method has wonthe3DPW Challenge

Inference Code

We releasestandalone TensorFlow models (SavedModel) to allow easy application in downstreamresearch. After loading the model, you can run inference in a single line of Pythonwithout havingthis codebase as a dependency. Try it in action inGoogle Colab.

Gist of Usage

importtensorflowastfimporttensorflow_hubastfhubmodel=tfhub.load('https://bit.ly/metrabs_l')image=tf.image.decode_jpeg(tf.io.read_file('img/test_image_3dpw.jpg'))pred=model.detect_poses(image)pred['boxes'],pred['poses2d'],pred['poses3d']

See also thedemos folder for more examples.

NOTE: The models can only be used fornon-commercial purposes due to the licensing of the usedtraining datasets.

Alternatively, you can try the experimental PyTorch version:

wget -O - https://bit.ly/metrabs_l_pt| tar -xzvf -python -m metrabs_pytorch.scripts.demo_image --model-dir metrabs_eff2l_384px_800k_28ds_pytorch --image img/test_image_3dpw.jpg

Demos

./demo.py to auto-download the model, predict on a sample image and display theresult with Matplotlib orPoseViz (if installed).
./demo_video.py filepath-or-url-to-video.mp4 to run inference on a video.

Documentation

How-to Guide with Examples
Full API Reference

Feature Summary

Several skeleton conventions supported through the keyword argumentskeleton (e.g. COCO,SMPL, H36M)
Multi-image (batched) and single-image predictions both supported
Advanced, parallelized cropping logic behind the scenes
- Anti-aliasing through image pyramid andsupersampling,gamma-correct rescaling.
- GPU-accelerated undistortion of pinhole perspective (homography) and radial/tangential lensdistortions
Estimates returned in3D world space (when calibration is provided) and2D pixel space
Built-in, configurabletest-time augmentation (TTA) with rotation, flip and brightness (keywordargumentnum_aug sets the number of TTA crops per detection)
Automaticsuppression of implausible poses and non-max suppression on the 3D pose level (can be turned off)
Multiple backbones with different speed-accuracy trade-off (EfficientNetV2, MobileNetV3)

Training and Evaluation

See the docs directory.

BibTeX

If you find this work useful in your research, please cite it as:

@article{sarandi2021metrabs,title={{MeTRAbs:} Metric-Scale Truncation-Robust Heatmaps for Absolute 3{D} Human Pose Estimation},author={S\'ar\'andi, Istv\'an and Linder, Timm and Arras, Kai O. and Leibe, Bastian},journal={IEEE Transactions on Biometrics, Behavior, and Identity Science},year={2021},volume={3},number={1},pages={16-30},doi={10.1109/TBIOM.2020.3037257}}

The above paper is an extended journal version of the FG'2020 conference paper:

@inproceedings{Sarandi20FG,title={Metric-Scale Truncation-Robust Heatmaps for 3{D} Human Pose Estimation},author={S\'ar\'andi, Istv\'an and Linder, Timm and Arras, Kai O. and Leibe, Bastian},booktitle={IEEE International Conference on Automatic Face and Gesture Recognition},pages={677-684},year={2020}}

The newer large-scale models correspond to the WACV'23 paper:

@inproceedings{Sarandi2023dozens,author ={S\'ar\'andi, Istv\'an and Hermans, Alexander and Leibe, Bastian},title ={Learning {3D} Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats},booktitle ={IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},year ={2023}}

Contact

Code in this repository was written byIstván Sárándi (RWTH AachenUniversity) unless indicated otherwise.

Got any questions or feedback? Drop a mail tosarandi@vision.rwth-aachen.de!

About

Estimate absolute 3D human poses from RGB images.

arxiv.org/abs/2007.07227

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MeTRAbs Absolute 3D Human Pose Estimator

News

Inference Code

Gist of Usage

Demos

Documentation

Feature Summary

Training and Evaluation

BibTeX

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages

Movatterモバイル変換

License

isarandi/metrabs

Folders and files

Latest commit

History

Repository files navigation

MeTRAbs Absolute 3D Human Pose Estimator

News

Inference Code

Gist of Usage

Demos

Documentation

Feature Summary

Training and Evaluation

BibTeX

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages