hcnoh/rl-collection-pytorchPublic

NotificationsYou must be signed in to change notification settings
Fork1
Star22

A collection of Reinforcement Learning implementations with PyTorch

License

MIT license

22 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
assets/img/README		assets/img/README
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
requirements.txt		requirements.txt
run.py		run.py
train.py		train.py

Repository files navigation

Reinforcement Learning Collection with PyTorch

This repository is a collection of the following reinforcement learning algorithms:

Policy-Gradient
Actor-Critic
Trust Region Policy Optimization
Generalized Advantage Estimation
Proximal Policy Optimization

More algorithms will be added on this repository.

In this repository,OpenAI Gym environments such asCartPole-v0,Pendulum-v0, andBipedalWalker-v3 are used. You need to install them before running this repository.

Note: The environment's names could be different depending on the version of OpenAI Gym.

Install Dependencies

Install Python 3.
Install the Python packages inrequirements.txt. If you are using a virtual environment for Python package management, you can install all python packages needed by using the following bash command:
```
$ pip install -r requirements.txt
```
Install other packages to run OpenAI Gym environments. These are dependent on the development setting of your machine.
Install PyTorch. The version of PyTorch should be greater or equal than 1.7.0.

Training and Running

Modifyconfig.json as your machine setting.
Execute training process bytrain.py. An example of usage fortrain.py are following:
```
$ python train.py --model_name=trpo --env_name=BipedalWalker-v3
```
The following bash command will help you:
```
$ python train.py -h
```
You can run your pre-trained agents by executingrun.py. The usage for runningrun.py is similar to that oftrain.py. You can also check the help message by the following bash bash command:
```
$ python run.py -h
```

The results of CartPole environment

The results of Pendulum environment

The results of BipedalWalker environment

Recent Works

The CUDA usage is provided now.
Modified some errors in GAE and PPO.
Modified some errors about horizon was corrected.

Future Works

Find the errors of the Actor-Critic
Implement ACER
Search other environments to running the algorithms

References

An explaination of TRPO line search:link
Additional stability method for PPO value function:link

About

A collection of Reinforcement Learning implementations with PyTorch

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Collection with PyTorch

Install Dependencies

Training and Running

The results of CartPole environment

The results of Pendulum environment

The results of BipedalWalker environment

Recent Works

Future Works

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

hcnoh/rl-collection-pytorch

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Collection with PyTorch

Install Dependencies

Training and Running

The results of CartPole environment

The results of Pendulum environment

The results of BipedalWalker environment

Recent Works

Future Works

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages