Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
This repository was archived by the owner on Jun 13, 2024. It is now read-only.

The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"

License

NotificationsYou must be signed in to change notification settings

microsoft/human-pose-estimation.pytorch

News

Introduction

This is an official pytorch implementation ofSimple Baselines for Human Pose Estimation and Tracking. This work provides baseline methods that are surprisingly simple and effective, thus helpful for inspiring and evaluating new ideas for the field. State-of-the-art results are achieved on challenging benchmarks. On COCO keypoints valid dataset, our bestsingle model achieves74.3 of mAP. You can reproduce our results using this repo. All models are provided for research purpose.

Main Results

Results on MPII val

ArchHeadShoulderElbowWristHipKneeAnkleMeanMean@0.1
256x256_pose_resnet_50_d256d256d25696.35195.32988.98983.17688.42083.96079.59488.53233.911
384x384_pose_resnet_50_d256d256d25696.65895.75489.79084.61488.52384.66679.28789.06638.046
256x256_pose_resnet_101_d256d256d25696.86295.87389.51884.37688.43784.48680.70389.13134.020
384x384_pose_resnet_101_d256d256d25696.96595.90790.26885.78089.59785.93582.09890.00338.860
256x256_pose_resnet_152_d256d256d25697.03395.94190.04684.97689.16485.31181.27189.62035.025
384x384_pose_resnet_152_d256d256d25696.79495.61890.08086.22589.70086.86282.85390.20039.433

Note:

  • Flip test is used.

Results on COCO val2017 with detector having human AP of 56.4 on COCO val2017 dataset

ArchAPAp .5AP .75AP (M)AP (L)ARAR .5AR .75AR (M)AR (L)
256x192_pose_resnet_50_d256d256d2560.7040.8860.7830.6710.7720.7630.9290.8340.7210.824
384x288_pose_resnet_50_d256d256d2560.7220.8930.7890.6810.7970.7760.9320.8380.7280.846
256x192_pose_resnet_101_d256d256d2560.7140.8930.7930.6810.7810.7710.9340.8400.7300.832
384x288_pose_resnet_101_d256d256d2560.7360.8960.8030.6990.8110.7910.9360.8510.7450.858
256x192_pose_resnet_152_d256d256d2560.7200.8930.7980.6870.7890.7780.9340.8460.7360.839
384x288_pose_resnet_152_d256d256d2560.7430.8960.8110.7050.8160.7970.9370.8580.7510.863

Results onCaffe-style ResNet

ArchAPAp .5AP .75AP (M)AP (L)ARAR .5AR .75AR (M)AR (L)
256x192_pose_resnet_50_caffe_d256d256d2560.7040.9140.7820.6770.7440.7350.9210.8050.7040.783
256x192_pose_resnet_101_caffe_d256d256d2560.7200.9150.8030.6930.7640.7530.9280.8210.7200.802
256x192_pose_resnet_152_caffe_d256d256d2560.7280.9250.8040.7020.7660.7600.9310.8280.7290.806

Note:

  • Flip test is used.
  • Person detector has person AP of 56.4 on COCO val2017 dataset.
  • Difference betweenPyTorch-style andCaffe-style ResNet is the position of stride=2 convolution

Environment

The code is developed using python 3.6 on Ubuntu 16.04. NVIDIA GPUs are needed. The code is developed and tested using 4 NVIDIA P100 GPU cards. Other platforms or GPU cards are not fully tested.

Quick start

Installation

  1. Install pytorch >= v0.4.0 followingofficial instruction.

  2. Disable cudnn for batch_norm:

    # PYTORCH=/path/to/pytorch# for pytorch v0.4.0sed -i "1194s/torch\.backends\.cudnn\.enabled/False/g" ${PYTORCH}/torch/nn/functional.py# for pytorch v0.4.1sed -i "1254s/torch\.backends\.cudnn\.enabled/False/g" ${PYTORCH}/torch/nn/functional.py

    Note that instructions like # PYTORCH=/path/to/pytorch indicate that you should pick a path where you'd like to have pytorch installed and then set an environment variable (PYTORCH in this case) accordingly.

  3. Clone this repo, and we'll call the directory that you cloned as ${POSE_ROOT}.

  4. Install dependencies:

    pip install -r requirements.txt
  5. Make libs:

    cd ${POSE_ROOT}/libmake
  6. InstallCOCOAPI:

    # COCOAPI=/path/to/clone/cocoapigit clone https://github.com/cocodataset/cocoapi.git $COCOAPIcd $COCOAPI/PythonAPI# Install into global site-packagesmake install# Alternatively, if you do not have permissions or prefer# not to install the COCO API into global site-packagespython3 setup.py install --user

    Note that instructions like # COCOAPI=/path/to/install/cocoapi indicate that you should pick a path where you'd like to have the software cloned and then set an environment variable (COCOAPI in this case) accordingly.

  7. Download pytorch imagenet pretrained models frompytorch model zoo and caffe-style pretrained models fromGoogleDrive.

  8. Download mpii and coco pretrained models fromOneDrive orGoogleDrive. Please download them under ${POSE_ROOT}/models/pytorch, and make them look like this:

    ${POSE_ROOT} `-- models     `-- pytorch         |-- imagenet         |   |-- resnet50-19c8e357.pth         |   |-- resnet50-caffe.pth.tar         |   |-- resnet101-5d3b4d8f.pth         |   |-- resnet101-caffe.pth.tar         |   |-- resnet152-b121ed2d.pth         |   `-- resnet152-caffe.pth.tar         |-- pose_coco         |   |-- pose_resnet_101_256x192.pth.tar         |   |-- pose_resnet_101_384x288.pth.tar         |   |-- pose_resnet_152_256x192.pth.tar         |   |-- pose_resnet_152_384x288.pth.tar         |   |-- pose_resnet_50_256x192.pth.tar         |   `-- pose_resnet_50_384x288.pth.tar         `-- pose_mpii             |-- pose_resnet_101_256x256.pth.tar             |-- pose_resnet_101_384x384.pth.tar             |-- pose_resnet_152_256x256.pth.tar             |-- pose_resnet_152_384x384.pth.tar             |-- pose_resnet_50_256x256.pth.tar             `-- pose_resnet_50_384x384.pth.tar
  9. Init output(training model output directory) and log(tensorboard log directory) directory:

    mkdir output mkdir log

    Your directory tree should look like this:

    ${POSE_ROOT}├── data├── experiments├── lib├── log├── models├── output├── pose_estimation├── README.md└── requirements.txt

Data preparation

For MPII data, please download fromMPII Human Pose Dataset. The original annotation files are in matlab format. We have converted them into json format, you also need to download them fromOneDrive orGoogleDrive.Extract them under {POSE_ROOT}/data, and make them look like this:

${POSE_ROOT}|-- data`-- |-- mpii    `-- |-- annot        |   |-- gt_valid.mat        |   |-- test.json        |   |-- train.json        |   |-- trainval.json        |   `-- valid.json        `-- images            |-- 000001163.jpg            |-- 000003072.jpg

For COCO data, please download fromCOCO download, 2017 Train/Val is needed for COCO keypoints training and validation. We also provide person detection result of COCO val2017 to reproduce our multi-person pose estimation results. Please download fromOneDrive orGoogleDrive.Download and extract them under {POSE_ROOT}/data, and make them look like this:

${POSE_ROOT}|-- data`-- |-- coco    `-- |-- annotations        |   |-- person_keypoints_train2017.json        |   `-- person_keypoints_val2017.json        |-- person_detection_results        |   |-- COCO_val2017_detections_AP_H_56_person.json        `-- images            |-- train2017            |   |-- 000000000009.jpg            |   |-- 000000000025.jpg            |   |-- 000000000030.jpg            |   |-- ...             `-- val2017                |-- 000000000139.jpg                |-- 000000000285.jpg                |-- 000000000632.jpg                |-- ...

Valid on MPII using pretrained models

python pose_estimation/valid.py \    --cfg experiments/mpii/resnet50/256x256_d256x3_adam_lr1e-3.yaml \    --flip-test \    --model-file models/pytorch/pose_mpii/pose_resnet_50_256x256.pth.tar

Training on MPII

python pose_estimation/train.py \    --cfg experiments/mpii/resnet50/256x256_d256x3_adam_lr1e-3.yaml

Valid on COCO val2017 using pretrained models

python pose_estimation/valid.py \    --cfg experiments/coco/resnet50/256x192_d256x3_adam_lr1e-3.yaml \    --flip-test \    --model-file models/pytorch/pose_coco/pose_resnet_50_256x192.pth.tar

Training on COCO train2017

python pose_estimation/train.py \    --cfg experiments/coco/resnet50/256x192_d256x3_adam_lr1e-3.yaml

Other Implementations

Citation

If you use our code or models in your research, please cite with:

@inproceedings{xiao2018simple,    author={Xiao, Bin and Wu, Haiping and Wei, Yichen},    title={Simple Baselines for Human Pose Estimation and Tracking},    booktitle = {European Conference on Computer Vision (ECCV)},    year = {2018}}

About

The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp