MathPhysSim/FERMI_RL_PaperPublic

NotificationsYou must be signed in to change notification settings
Fork2
Star6

The repo for the FERMI FEL paper using model-based and model-free reinforcement learning methods to solve a particle accelerator operation problem.

License

MIT license

6 stars 2 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 204 Commits
Data_Experiments		Data_Experiments
Figures		Figures
bst		bst
tex		tex
AEDYNA.py		AEDYNA.py
AEDYNA_on_dummy_fel_simulation.py		AEDYNA_on_dummy_fel_simulation.py
AE_Dyna_Tensorflow_2.py		AE_Dyna_Tensorflow_2.py
LICENSE.txt		LICENSE.txt
README.md		README.md
SAC_TFlayers.py		SAC_TFlayers.py
inverted_pendulum.py		inverted_pendulum.py
local_fel_simulated_env.py		local_fel_simulated_env.py
main.aux		main.aux
main.log		main.log
main.out		main.out
main.pdf		main.pdf
main.synctex.gz		main.synctex.gz
main.tex		main.tex
mainNotes.bib		mainNotes.bib
naf2_new.py		naf2_new.py
read_naf_tests.py		read_naf_tests.py
read_paper_tests.py		read_paper_tests.py
run_aedyna_noise_test_pendulum.py		run_aedyna_noise_test_pendulum.py
run_naf2.py		run_naf2.py
run_naf2_for_tests.py		run_naf2_for_tests.py
run_paper_naf_tests.py		run_paper_naf_tests.py
simulated_tango.py		simulated_tango.py
utilities.py		utilities.py

Repository files navigation

Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

Contact: simon.hirlaender(at)sbg.ac.at

Pre-printhttps://arxiv.org/abs/2012.09737

Please cite code as:

The included scripts:

To run the NAF2 as used in the paper on the pendulum run:run_naf2.py
To run the AE-DYNA as used in the paper on the pendulum run:AEDYNA.py
To run the AE-DYNA with tensorflow 2 on the pendulum run:AE_Dyna_Tensorflow_2.py

The rest should be straight forward, otherwise contact us.

These are the results of RL tests @FERMI-FEL

The problem has four degrees of freedom in state and action space.A schematic overview:

Algorithm	Type	Representational power	Noise resistive	Sample efficiency
NAF	Model-free	Low	No	High
NAF2	Model-free	Low	Yes	High
ME-TRPO	Model-based	High	No	High
AE-DYNA	Model-based	High	Yes	High

Experiments done on the machine:

A new implementation of the NAF with double Q learning (single network dashed, double network solid):

A new implementation of aAE-DYNA:

A variant of theME-TRPO:

The evolution as presented at GSITowards Artificial Intelligence in Accelerator Operation:

Experiments done on theinverted pendulum openai gym environment:

Cumulative reward of differentNAF implementations on theinverted pendulum with artificial noise.

Comparison of the inclusion of aleatoric noise in the AE-DYNA in the noisyinverted pendulum:

Sample efficiency ofNAF andAE-DYNA:

Free run on theinverted pendulum:

Update of AE-Dyna-(SAC) to Tensorflow 2

Finally, there is an update of the AE-dyna to use tensorflow 2. Run the scriptAE_Dyna_Tensorflow_2.py.It is based on tensor_layerstensorlayer, which has to be installed.The scriptAE_Dyna_Tensorflow_2.py runs on the inverted pendulum and produces results like shown in the figure below.

If you have questions do not hesitate to contact us.

About

The repo for the FERMI FEL paper using model-based and model-free reinforcement learning methods to solve a particle accelerator operation problem.

Releases2

Preprint release Latest

Dec 18, 2020

+ 1 release

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

The included scripts:

These are the results of RL tests @FERMI-FEL

Experiments done on the machine:

The evolution as presented at GSITowards Artificial Intelligence in Accelerator Operation:

Experiments done on theinverted pendulum openai gym environment:

Update of AE-Dyna-(SAC) to Tensorflow 2

About

Topics

Resources

License

Stars

Watchers

Forks

Releases2

Packages

Languages

Movatterモバイル変換

License

MathPhysSim/FERMI_RL_Paper

Folders and files

Latest commit

History

Repository files navigation

Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

The included scripts:

These are the results of RL tests @FERMI-FEL

Experiments done on the machine:

The evolution as presented at GSITowards Artificial Intelligence in Accelerator Operation:

Experiments done on theinverted pendulum openai gym environment:

Update of AE-Dyna-(SAC) to Tensorflow 2

About

Topics

Resources

License

Stars

Watchers

Forks

Releases2

Packages0

Languages

Packages