mfigurnov/sactPublic

NotificationsYou must be signed in to change notification settings
Fork53
Star247

Spatially Adaptive Computation Time for Residual Networks

License

Apache-2.0 license

247 stars 53 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
external		external
pics		pics
testdata		testdata
.gitignore		.gitignore
AUTHORS		AUTHORS
LICENSE		LICENSE
README.md		README.md
act.py		act.py
act_test.py		act_test.py
cifar_data_provider.py		cifar_data_provider.py
cifar_data_provider_test.py		cifar_data_provider_test.py
cifar_main.py		cifar_main.py
cifar_model.py		cifar_model.py
cifar_model_test.py		cifar_model_test.py
draw_ponder_maps.py		draw_ponder_maps.py
fake_cifar10.py		fake_cifar10.py
fake_imagenet.py		fake_imagenet.py
flopsometer.py		flopsometer.py
flopsometer_test.py		flopsometer_test.py
imagenet_data_provider.py		imagenet_data_provider.py
imagenet_data_provider_test.py		imagenet_data_provider_test.py
imagenet_eval.py		imagenet_eval.py
imagenet_export.py		imagenet_export.py
imagenet_model.py		imagenet_model.py
imagenet_model_test.py		imagenet_model_test.py
imagenet_ponder_map.py		imagenet_ponder_map.py
imagenet_train.py		imagenet_train.py
requirements-gpu.txt		requirements-gpu.txt
requirements.txt		requirements.txt
resnet_act.py		resnet_act.py
squeeze_model.py		squeeze_model.py
summary_utils.py		summary_utils.py
summary_utils_test.py		summary_utils_test.py
training_utils.py		training_utils.py
utils.py		utils.py

Repository files navigation

Spatially Adaptive Computation Time for Residual Networks

This code implements a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image.The architecture is end-to-end trainable, deterministic and problem-agnostic.The included code applies this to the CIFAR-10 an ImageNet image classification problems.It is implemented using TensorFlow and TF-Slim.

Paper describing the project:

Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov. Spatially Adaptive Computation Time for Residual Networks.CVPR 2017[arxiv].

Image (with detections)	Ponder cost map

Setup

Install prerequisites:

pip install -r requirements.txt# CPUpip install -r requirements-gpu.txt# GPU

Prerequisite packages:

Python 2.x/3.x (mostly tested with Python 2.7)
Tensorflow 1.0
NumPy
(Optional) nose
(Optional) h5py
(Optional) matplotlib

Run tests. It takes a couple of minutes:

nosetests --logging-level=WARNING

CIFAR-10

Download and convert CIFAR-10 dataset:

PYTHONPATH=external python external/download_and_convert_cifar10.py --dataset_dir="${HOME}/tensorflow/data/cifar10"

Let's train and continuously evaluate a CIFAR-10 Adaptive Computation Time model with five residual units per block (ResNet-32):

export ACT_LOGDIR='/tmp/cifar10_resnet_5_act_1e-2'python cifar_main.py --model_type=act --model=5 --tau=0.01 --train_log_dir="${ACT_LOGDIR}/train" --save_summaries_secs=300&python cifar_main.py --model_type=act --model=5 --tau=0.01 --checkpoint_dir="${ACT_LOGDIR}/train" --eval_dir="${ACT_LOGDIR}/eval" --mode=eval

Or, forspatially adaptive computation time (SACT):

export SACT_LOGDIR='/tmp/cifar10_resnet_5_sact_1e-2'python cifar_main.py --model_type=sact --model=5 --tau=0.01 --train_log_dir="${SACT_LOGDIR}/train" --save_summaries_secs=300&python cifar_main.py --model_type=sact --model=5 --tau=0.01 --checkpoint_dir="${SACT_LOGDIR}/train" --eval_dir="${SACT_LOGDIR}/eval" --mode=eval

To download and evaluate apretrained ResNet-32 SACT model (1.8 MB file):

mkdir -p models&& curl https://s3.us-east-2.amazonaws.com/sact-models/cifar10_resnet_5_sact_1e-2.tar.gz| tar xv -C modelspython cifar_main.py --model_type=sact --model=5 --tau=0.01 --checkpoint_dir='models/cifar10_resnet_5_sact_1e-2' --mode=eval --eval_dir='/tmp' --evaluate_once

This model is expected to achieve an accuracy of 91.82%, with the output looking like so:

eval/Accuracy[0.9182]eval/Mean Loss[0.59591407]Total Flops/mean[82393168]Total Flops/std[7588926]...

ImageNet

Follow theinstructions to prepare the ImageNet dataset in TF-Slim format.The default directory for the dataset is~/tensorflow/imagenet.You can change it with the--dataset_dir flag.

We initialized all ACT/SACT models with apretrained ResNet-101 model (159MB file).

Downloadpretrained ResNet-101 SACT model, trained with tau=0.005 (160 MB file):

mkdir -p models&& curl https://s3.us-east-2.amazonaws.com/sact-models/imagenet_101_sact_5e-3.tar.gz| tar xv -C models

Evaluate the pretrained model

python imagenet_eval.py --model_type=sact --model=101 --tau=0.005 --checkpoint_dir=models/imagenet_101_sact_5e-3 --eval_dir=/tmp --evaluate_once

Expected output:

eval/Accuracy[0.75609803]eval/Recall@5[0.9274632117722329]Total Flops/mean[1.1100941e+10]Total Flops/std[4.5691142e+08]...

Note that evaluation on the full validation dataset will take some time using only CPU.Add the arguments--num_examples=10 --batch_size=10 for a quicker test.

Draw some images from ImageNet validation set and the corresponding ponder cost maps:

python imagenet_export.py --model_type=sact --model=101 --tau=0.005 --checkpoint_dir=models/imagenet_101_sact_5e-3 --export_path=/tmp/maps.h5 --batch_size=1 --num_examples=200mkdir /tmp/mapspython draw_ponder_maps.py --input_file=/tmp/maps.h5 --output_dir=/tmp/maps

Example visualizations. See Figure 9 of the paper for more

Image	Ponder cost map

Apply the pretrained model to your own jpeg images.For best results, first resize them to somewhere between 320x240 and 640x480.

python2 imagenet_ponder_map.py --model=101 --checkpoint_dir=models/imagenet_101_sact_5e-3 --images_pattern=pics/gasworks.jpg --output_dir output/

Image	Ponder cost map	Colorbar

Note that an ImageNet-pretrained model tends to ignore people - there is no "person" class in ImageNet!

Disclaimer

This is not an official Google product.

About

Spatially Adaptive Computation Time for Residual Networks

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Spatially Adaptive Computation Time for Residual Networks

Setup

CIFAR-10

ImageNet

Disclaimer

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

mfigurnov/sact

Folders and files

Latest commit

History

Repository files navigation

Spatially Adaptive Computation Time for Residual Networks

Setup

CIFAR-10

ImageNet

Disclaimer

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages