Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

License

NotificationsYou must be signed in to change notification settings

CMU-Perceptual-Computing-Lab/caffe_train

Repository files navigation

Our modified caffe for training multi-person pose estimator. The original caffe version is in July 2016. This repository at least runs on Ubuntu 14.04, OpenCV 2.4.10, CUDA 7.5/8.0, and CUDNN 5.

Thefull project repo includes detailed training steps and the testing code in matlab, C++ and python.

We add customized caffe layer for data augmentation:cpm_data_transformer.cpp, including scale augmentation e.g., in the range of 0.7 to 1.3, rotation augmentation, e.g., in the range of -40 to 40 degrees, flip augmentation and image cropping. This augmentation strategy makes the method capable of dealing with a large range of scales and orientations. You can set the augmentation parameters insetLayers.py. Example data layer parameters in thetraining prototxt is:

layer {  name: "data"  type: "CPMData"  top: "data"  top: "label"  data_param {    source: "/home/zhecao/COCO_kpt/lmdb_trainVal"    batch_size: 10    backend: LMDB  }  cpm_transform_param {    stride: 8    max_rotate_degree: 40    visualize: false    crop_size_x: 368    crop_size_y: 368    scale_prob: 1    scale_min: 0.5    scale_max: 1.1    target_dist: 0.6    center_perterb_max: 40    do_clahe: false    num_parts: 56    np_in_lmdb: 17  }}

This project is licensed under the terms of the GPL v3 licenseLicense. We will merge it with the caffe testing version (https://github.com/CMU-Perceptual-Computing-Lab/caffe_rtpose) later.

Citation

Please cite the paper in your publications if it helps your research:

@article{cao2016realtime,  title={Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},  author={Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},  journal={arXiv preprint arXiv:1611.08050},  year={2016}  }@inproceedings{wei2016cpm,  author = {Shih-En Wei and Varun Ramakrishna and Takeo Kanade and Yaser Sheikh},  booktitle = {CVPR},  title = {Convolutional pose machines},  year = {2016}  }

Caffe

Build StatusLicense

Caffe is a deep learning framework made with expression, speed, and modularity in mind.It is developed by the Berkeley Vision and Learning Center (BVLC) and community contributors.

Check out theproject site for all the details like

and step-by-step examples.

Join the chat at https://gitter.im/BVLC/caffe

Please join thecaffe-users group orgitter chat to ask questions and talk about methods and models.Framework development discussions and thorough bug reports are collected onIssues.

Happy brewing!

License and Citation

Caffe is released under theBSD 2-Clause license.The BVLC reference models are released for unrestricted use.

Please cite Caffe in your publications if it helps your research:

@article{jia2014caffe,  Author = {Jia, Yangqing and Shelhamer, Evan and Donahue, Jeff and Karayev, Sergey and Long, Jonathan and Girshick, Ross and Guadarrama, Sergio and Darrell, Trevor},  Journal = {arXiv preprint arXiv:1408.5093},  Title = {Caffe: Convolutional Architecture for Fast Feature Embedding},  Year = {2014}}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp