XifengGuo/CapsNet-KerasPublic

NotificationsYou must be signed in to change notification settings
Fork648
Star2.5k

A Keras implementation of CapsNet in NIPS2017 paper "Dynamic Routing Between Capsules". Now test error ＝ 0.34%.

License

MIT license

2.5k stars 648 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
result		result
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
capsulelayers.py		capsulelayers.py
capsulenet-multi-gpu.py		capsulenet-multi-gpu.py
capsulenet.py		capsulenet.py
utils.py		utils.py

Repository files navigation

CapsNet-Keras

A Keras (branchtf2.2 supports TensorFlow 2) implementation of CapsNet in the paper:
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton. Dynamic Routing Between Capsules. NIPS 2017
The currentaverage test error = 0.34% andbest test error = 0.30%.

Differences with the paper:

We use the learning rate decay withdecay factor = 0.9 andstep = 1 epoch,
while the paper did not give the detailed parameters (or they didn't use it?).
We only report the test errors after50 epochs training.
In the paper, I suppose they trained for1250 epochs according to Figure A.1?Sounds crazy, maybe I misunderstood.
We use MSE (mean squared error) as the reconstruction loss andthe coefficient for the loss islam_recon=0.0005*784=0.392.
This should beequivalent with using SSE (sum squared error) andlam_recon=0.0005 as in the paper.

Warnning

Please use Keras==2.0.7 with TensorFlow==1.2 backend, or theK.batch_dot function may not work correctly.

However, if you useTensorflow>=2.0, then checkout branchtf2.2

Usage

Step 1. Clone this repository to local.

git clone https://github.com/XifengGuo/CapsNet-Keras.git capsnet-kerascd capsnet-kerasgit checkout tf2.2 # Only if use Tensorflow>=2.0

Step 2.Install Keras==2.0.7 with TensorFlow==1.2 backend.

pip install tensorflow-gpu==1.2pip install keras==2.0.7

or install Tensorflow>=2.0

pip install tensorflow==2.2

Step 3. Train a CapsNet on MNIST

Training with default settings:

python capsulenet.py

More detailed usage run for help:

python capsulenet.py -h

Step 4. Test a pre-trained CapsNet model

Suppose you have trained a model using the above command, then the trained model will besaved toresult/trained_model.h5. Now just launch the following command to get test results.

$ python capsulenet.py -t -w result/trained_model.h5

It will output the testing accuracy and show the reconstructed images.The testing data is same as the validation data. It will be easy to test on new data,just change the code as you want.

You can also justdownload a model I trained fromhttps://pan.baidu.com/s/1sldqQo1orhttps://drive.google.com/open?id=1A7pRxH7iWzYZekzr-O0nrwqdUUpUpkik

Step 5. Train on multi gpus

This requiresKeras>=2.0.9. After updating Keras:

python capsulenet-multi-gpu.py --gpus 2

It will automatically train on multi gpus for 50 epochs and then output the performance on test dataset.But during training, no validation accuracy is reported.

Results

Test Errors

CapsNet classification testerror on MNIST. Average and standard deviation results arereported by 3 trials. The results can be reproduced by launching the following commands.

python capsulenet.py --routings 1 --lam_recon 0.0    #CapsNet-v1   python capsulenet.py --routings 1 --lam_recon 0.392  #CapsNet-v2python capsulenet.py --routings 3 --lam_recon 0.0    #CapsNet-v3 python capsulenet.py --routings 3 --lam_recon 0.392  #CapsNet-v4

Method	Routing	Reconstruction	MNIST (%)	Paper
Baseline	--	--	--	0.39
CapsNet-v1	1	no	0.39 (0.024)	0.34 (0.032)
CapsNet-v2	1	yes	0.36 (0.009)	0.29 (0.011)
CapsNet-v3	3	no	0.40 (0.016)	0.35 (0.036)
CapsNet-v4	3	yes	0.34 (0.016)	0.25 (0.005)

Losses and accuracies:

Training Speed

About100s / epoch on a single GTX 1070 GPU.
About80s / epoch on a single GTX 1080Ti GPU.
About55s / epoch on two GTX 1080Ti GPU by usingcapsulenet-multi-gpu.py.

Reconstruction result

The result of CapsNet-v4 by launching

python capsulenet.py -t -w result/trained_model.h5

Digits at top 5 rows are real images from MNIST anddigits at bottom are corresponding reconstructed images.

Manipulate latent code

python capsulenet.py -t --digit 5 -w result/trained_model.h5

For each digit, theith row corresponds to theith dimension of the capsule, and columns from left toright correspond to adding[-0.25, -0.2, -0.15, -0.1, -0.05, 0, 0.05, 0.1, 0.15, 0.2, 0.25] tothe value of one dimension of the capsule.

As we can see, each dimension has caught some characteristics of a digit. The same dimension ofdifferent digit capsules may represent different characteristics. This is because that differentdigits are reconstructed from different feature vectors (digit capsules). These vectors are mutuallyindependent during reconstruction.

Other Implementations

PyTorch:
TensorFlow:
- naturomics/CapsNet-Tensorflow
  I referred to some functions in this repository.
- InnerPeace-Wu/CapsNet-tensorflow
- chrislybaer/capsules-tensorflow
MXNet:
- AaronLeong/CapsNet_Mxnet
Chainer:
- soskek/dynamic_routing_between_capsules
Matlab:
- yechengxi/LightCapsNet

About

A Keras implementation of CapsNet in NIPS2017 paper "Dynamic Routing Between Capsules". Now test error ＝ 0.34%.

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

CapsNet-Keras

Warnning

Usage

Results

Test Errors

Training Speed

Reconstruction result

Manipulate latent code

Other Implementations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors4

Uh oh!

Languages

Movatterモバイル変換

License

XifengGuo/CapsNet-Keras

Folders and files

Latest commit

History

Repository files navigation

CapsNet-Keras

Warnning

Usage

Results

Test Errors

Training Speed

Reconstruction result

Manipulate latent code

Other Implementations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors4

Uh oh!

Languages

Packages