- Notifications
You must be signed in to change notification settings - Fork20
mhubii/ppo_libtorch
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
This is an implementation of theproximal policy optimization algorithm for the C++ API of Pytorch. It uses a simpleTestEnvironment to test the algorithm. Below is a small visualization of the environment, the algorithm is tested in.
You first need to install PyTorch. For a clean installation from Anaconda, checkout this shorttutorial, or thistutorial, to only install the binaries.
Do
mkdir buildcd buildcmake -DCMAKE_PREFIX_PATH=/absolut/path/to/libtorch ..makeRun the executable with
cd build./train_ppoTo plot the results, run
cd ..python plot.py --online_view --csv_file data/data.csv --epochs 1 10It should produce something like shown below.
The algorithm can also be used in test mode, once trained. Therefore, run
cd build./test_ppoTo plot the results, run
cd ..python plot.py --online_view --csv_file data/data_test.csv --epochs 1The results are saved todata/data.csv and can be visualized by runningpython plot.py. Run
python plot.py --help
for help.
About
C++ implementation of Proximal Policy Optimization
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
Packages0
Uh oh!
There was an error while loading.Please reload this page.


