ppo

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

python machine-learning reinforcement-learning deep-learning deep-reinforcement-learning pytorch gym atari actor-critic ale proximal-policy-optimization ppo advantage-actor-critic a2c wandb phasic-policy-gradient

UpdatedJul 8, 2025
Python

udacity /deep-reinforcement-learning

Star5.1k

Repo for the Deep Reinforcement Learning Nanodegree program

reinforcement-learning deep-reinforcement-learning openai-gym pytorch dqn neural-networks reinforcement-learning-algorithms dynamic-programming hill-climbing ddpg cross-entropy openai-gym-solutions pytorch-rl ppo ml-agents rl-algorithms

UpdatedNov 16, 2023
Jupyter Notebook

andri27-ts /Reinforcement-Learning

Star4.7k

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

machine-learning reinforcement-learning qlearning deep-learning deep-reinforcement-learning artificial-intelligence dqn deepmind evolution-strategies ppo a2c policy-gradients

UpdatedJun 30, 2020
Jupyter Notebook

sweetice /Deep-reinforcement-learning-with-pytorch

Star4.6k

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

algorithm deep-learning deep-reinforcement-learning pytorch dqn policy-gradient sarsa resnet a3c reinforce sac alphago actor-critic trpo ppo a2c actor-critic-algorithm td3

UpdatedMar 24, 2023
Python

AI4Finance-Foundation /ElegantRL

Star4.3k

Massively Parallel Deep Reinforcement Learning. 🔥

lightweight reinforcement-learning gae efficient pytorch stable dqn ddpg sac per multiple-gpu ppo a2c td3 model-free-rl drl-pytorch bipedalwalkerhardcore

UpdatedFeb 20, 2026
Python

simoninithomas /Deep_reinforcement_learning_Course

Star3.9k

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

qlearning deep-learning unity tensorflow deep-reinforcement-learning pytorch tensorflow-tutorials deep-q-network actor-critic deep-q-learning ppo a2c

UpdatedMay 2, 2023
Jupyter Notebook

ikostrikov /pytorch-a2c-ppo-acktr-gail

Star3.9k

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

reinforcement-learning deep-learning deep-reinforcement-learning pytorch atari hessian second-order continuous-control actor-critic ale mujoco proximal-policy-optimization ppo advantage-actor-critic a2c acktr natural-gradients roboschool kfac kronecker-factored-approximation

UpdatedMay 29, 2022
Python

ShangtongZhang /DeepRL

Star3.4k

Modularized Implementation of Deep RL Algorithms in PyTorch

deep-reinforcement-learning rainbow pytorch dqn ddpg double-dqn dueling-network-architecture quantile-regression option-critic-architecture deeprl categorical-dqn ppo a2c prioritized-experience-replay option-critic td3

UpdatedApr 16, 2024
Python

XinJingHao /DRL-Pytorch

Star3.3k

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

machine-learning reinforcement-learning asl deep-reinforcement-learning q-learning pytorch ddpg sac double-dqn c51 dueling-dqn categorical-dqn ppo prioritized-experience-replay noisynet-dqn td3

UpdatedJun 11, 2025
Python

seungeunrho /minimalRL

Star3.1k

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

machine-learning reinforcement-learning deep-learning simple deep-reinforcement-learning pytorch dqn a3c reinforce ddpg sac acer ppo a2c policy-gradients

UpdatedApr 22, 2023
Python

AI4Finance-Foundation /FinRL-Trading

Star2.7k

For trading. Please star.

deep-reinforcement-learning openai-gym sharpe-ratio ddpg stock-trading ppo a2c-algorithm ensemble-strategy stock-trading-strategy automated-stock-trading

UpdatedFeb 5, 2026
Python

nikhilbarhate99 /PPO-PyTorch

Star2.3k

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

reinforcement-learning deep-learning deep-reinforcement-learning pytorch policy-gradient reinforcement-learning-algorithms pytorch-tutorial proximal-policy-optimization ppo pytorch-implmention ppo-pytorch

UpdatedJul 9, 2024
Python

marlbenchmark /on-policy

Star1.9k

This is the official implementation of Multi-Agent PPO (MAPPO).

algorithms multi-agent hanabi smac ppo mpes starcraftii mappo

UpdatedJul 18, 2024
Python

kengz /SLM-Lab

Star1.3k

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

benchmark reinforcement-learning deep-reinforcement-learning pytorch dqn policy-gradient a3c sac ppo a2c

UpdatedFeb 20, 2026
Python

Khrylx /PyTorch-RL

Star1.3k

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learning deep-reinforcement-learning pytorch generative-adversarial-network policy-gradient trpo fisher-vectors pytorch-rl proximal-policy-optimization ppo a2c