mhubii/ppo_libtorchPublic

NotificationsYou must be signed in to change notification settings
Fork20
Star88

C++ implementation of Proximal Policy Optimization

88 stars 20 forks Branches Tags Activity

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
data		data
img		img
CMakeLists.txt		CMakeLists.txt
Models.h		Models.h
ProximalPolicyOptimization.h		ProximalPolicyOptimization.h
README.md		README.md
TestEnvironment.h		TestEnvironment.h
TestPPO.cpp		TestPPO.cpp
TrainPPO.cpp		TrainPPO.cpp
plot.py		plot.py

Repository files navigation

PPO Pytorch C++

This is an implementation of theproximal policy optimization algorithm for the C++ API of Pytorch. It uses a simpleTestEnvironment to test the algorithm. Below is a small visualization of the environment, the algorithm is tested in.

Fig. 1: The agent in testing mode.

Build

You first need to install PyTorch. For a clean installation from Anaconda, checkout this shorttutorial, or thistutorial, to only install the binaries.

mkdir buildcd buildcmake -DCMAKE_PREFIX_PATH=/absolut/path/to/libtorch ..make

Run

Run the executable with

cd build./train_ppo

To plot the results, run

cd ..python plot.py --online_view --csv_file data/data.csv --epochs 1 10

It should produce something like shown below.

Fig. 2: From left to right, the agent for successive epochs in training mode as it takes actions in the environment to reach the goal.

The algorithm can also be used in test mode, once trained. Therefore, run

cd build./test_ppo

To plot the results, run

cd ..python plot.py --online_view --csv_file data/data_test.csv --epochs 1

Visualization

The results are saved todata/data.csv and can be visualized by runningpython plot.py. Run

python plot.py --help

for help.

About

C++ implementation of Proximal Policy Optimization

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

PPO Pytorch C++

Build

Run

Visualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

mhubii/ppo_libtorch

Folders and files

Latest commit

History

Repository files navigation

PPO Pytorch C++

Build

Run

Visualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages