sash-a/es_pytorchPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star27

High performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 188 Commits
configs		configs
src		src
test		test
.gitignore		.gitignore
README.md		README.md
batch_run.py		batch_run.py
env.yml		env.yml
flagrun.py		flagrun.py
multi_agent.py		multi_agent.py
nsra.py		nsra.py
obj.py		obj.py
run_saved.py		run_saved.py
simple_example.py		simple_example.py

Repository files navigation

Depreciated in faviour of mymuch faster implementation in Julia

Evolutionary strategies (deep neuroevolution) in pytorch using MPI

This implementation was made to be as simple and efficient as possible.
Reference implementation can be foundhere (in tensorflow using redis).
Based on two papers by uber AI labshere andhere.

Implementation

This was made for use on a cluster using MPI (however it can be used on a single machine). With regards to efficiency itonly scatters the positive fitness, negative fitness and noise index, per policy evaluated, to all other processes each generation. The noise is placed in a blockof shared memory (on each node) for fast access and low memory footprint.

How to run

conda install:conda install -n es_env -f env.yml
example usages:simple_example.pyobj.pynsra.py
example configs are inconfig/

conda activate es_envmpirun -np {num_procs} python simple_example.py configs/simple_conf.json

Make sure that you insert this line before you create your neural network as the initial creation sets theinitial parameters, which must be deterministic across all threads

torch.random.manual_seed({seed})

General info

In order to define a policy create asrc.nn.nn.BaseNet (which is a simple extension of atorch.nn.Module) andpass it to aPolicy along with ansrc.nn.optimizers.Optimizer and float value for the noise standard deviation, anexample of this can be seen insimple_example.py.
If you wish to share the noise using shared memory and MPI, then instantiate theNoiseTable usingNoiseTable.create_shared(...), otherwise if you wish to use your own method of sharing noise/runningsequentially then simply create the noise table using its constructor and pass your noise to it like this:NoiseTable(my_noise, n_params)
NoiseTable.create_shared(...) will throw an error if less than 2 MPI procs are used

About

High performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Depreciated in faviour of mymuch faster implementation in Julia

Evolutionary strategies (deep neuroevolution) in pytorch using MPI

Implementation

How to run

General info

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

sash-a/es_pytorch

Folders and files

Latest commit

History

Repository files navigation

Depreciated in faviour of mymuch faster implementation in Julia

Evolutionary strategies (deep neuroevolution) in pytorch using MPI

Implementation

How to run

General info

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages