seungeunrho/minimalRLPublic

NotificationsYou must be signed in to change notification settings
Fork474
Star3k

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

License

MIT license

3k stars 474 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
LICENSE		LICENSE
README.md		README.md
REINFORCE.py		REINFORCE.py
a2c.py		a2c.py
a3c.py		a3c.py
acer.py		acer.py
actor_critic.py		actor_critic.py
ddpg.py		ddpg.py
dqn.py		dqn.py
ppo-continuous.py		ppo-continuous.py
ppo-lstm.py		ppo-lstm.py
ppo.py		ppo.py
sac.py		sac.py
vtrace.py		vtrace.py

Repository files navigation

minimalRL-pytorch

Implementations of basic RL algorithms with minimal lines of codes! (PyTorch based)

Each algorithm is complete within a single file.
Length of each file is up to 100~150 lines of codes.
Every algorithm can be trained within 30 seconds, even without GPU.
Envs are fixed to "CartPole-v1". You can just focus on the implementations.

Algorithms

REINFORCE (67 lines)
Vanilla Actor-Critic (98 lines)
DQN (112 lines, including replay memory and target network)
PPO (119 lines, including GAE)
DDPG (145 lines, including OU noise and soft target update)
A3C (129 lines)
ACER (149 lines)
A2C (188 lines)
SAC (171 lines) added!!
PPO-Continuous (161 lines) added!!
Vtrace (137 lines) added!!
Any suggestion ...?

Dependencies

PyTorch
OpenAI GYM ( > 0.26.2 IMPORTANT!! No longer support for the previous versions)

Usage

# Works only with Python 3.# e.g.python3 REINFORCE.pypython3 actor_critic.pypython3 dqn.pypython3 ppo.pypython3 ddpg.pypython3 a3c.pypython3 a2c.pypython3 acer.pypython3 sac.py

About

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Contributors4

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

minimalRL-pytorch

Algorithms

Dependencies

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors4

Uh oh!

Languages

Movatterモバイル変換

License

seungeunrho/minimalRL

Folders and files

Latest commit

History

Repository files navigation

minimalRL-pytorch

Algorithms

Dependencies

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors4

Uh oh!

Languages