Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

The repo for the FERMI FEL paper using model-based and model-free reinforcement learning methods to solve a particle accelerator operation problem.

License

NotificationsYou must be signed in to change notification settings

MathPhysSim/FERMI_RL_Paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contact: simon.hirlaender(at)sbg.ac.at

Pre-printhttps://arxiv.org/abs/2012.09737

Please cite code as:

DOI

The included scripts:

  1. To run the NAF2 as used in the paper on the pendulum run:run_naf2.py
  2. To run the AE-DYNA as used in the paper on the pendulum run:AEDYNA.py
  3. To run the AE-DYNA with tensorflow 2 on the pendulum run:AE_Dyna_Tensorflow_2.py

The rest should be straight forward, otherwise contact us.

These are the results of RL tests @FERMI-FEL

The problem has four degrees of freedom in state and action space.A schematic overview:

SchemaFERMIFEL

AlgorithmTypeRepresentational powerNoise resistiveSample efficiency
NAFModel-freeLowNoHigh
NAF2Model-freeLowYesHigh
ME-TRPOModel-basedHighNoHigh
AE-DYNAModel-basedHighYesHigh

Experiments done on the machine:

A new implementation of the NAF with double Q learning (single network dashed, double network solid):

NAF2_training

NAF2_training

A new implementation of aAE-DYNA:

AE-DYNA

AE-DYNA

A variant of theME-TRPO:

ME-TRPO

ME-TRPO

ME-TRPO

Experiments done on theinverted pendulum openai gym environment:

Cumulative reward of differentNAF implementations on theinverted pendulum with artificial noise.

NAF_NOISE

Comparison of the inclusion of aleatoric noise in the AE-DYNA in the noisyinverted pendulum:

AE-DYNA_NOISE

Comparison of the inclusion of aleatoric noise in the AE-DYNA in the noisyinverted pendulum:

AE-DYNA_NOISE

Sample efficiency ofNAF andAE-DYNA:

AE-DYNA

Free run on theinverted pendulum:

AE-DYNA

Update of AE-Dyna-(SAC) to Tensorflow 2

Finally, there is an update of the AE-dyna to use tensorflow 2. Run the scriptAE_Dyna_Tensorflow_2.py.It is based on tensor_layerstensorlayer, which has to be installed.The scriptAE_Dyna_Tensorflow_2.py runs on the inverted pendulum and produces results like shown in the figure below.

img.pngIf you have questions do not hesitate to contact us.


[8]ページ先頭

©2009-2025 Movatter.jp