proximal-policy-optimization
Here are 232 public repositories matching this topic...
Language:All
Sort:Most stars
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
- Updated
Mar 31, 2024 - Python
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
- Updated
Apr 8, 2025 - Python
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
- Updated
Jun 19, 2025 - Python
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
- Updated
May 29, 2022 - Python
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
- Updated
Jul 9, 2024 - Python
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
- Updated
Feb 9, 2021 - Python
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
- Updated
Jul 24, 2021 - Python
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
- Updated
Jan 16, 2021 - Python
A PyTorch library for building deep reinforcement learning agents.
- Updated
Mar 17, 2024 - Python
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
- Updated
Nov 11, 2017 - Python
PyTorch C++ Reinforcement Learning
- Updated
May 3, 2020 - C++
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
- Updated
Feb 6, 2024 - Python
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
- Updated
Nov 19, 2022 - Jupyter Notebook
Deep Reinforcement Learning in C#
- Updated
Jul 4, 2025 - C#
Trading Environment(OpenAI Gym) + PPO(TensorForce)
- Updated
Dec 8, 2022 - Python
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
- Updated
Oct 5, 2022 - Python
Clean baseline implementation of PPO using an episodic TransformerXL memory
- Updated
Jun 18, 2024 - Python
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.
- Updated
Jun 24, 2025 - Python
Baseline implementation of recurrent PPO using truncated BPTT
- Updated
Apr 28, 2024 - Jupyter Notebook
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
- Updated
Nov 15, 2021 - Python
Improve this page
Add a description, image, and links to theproximal-policy-optimization topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theproximal-policy-optimization topic, visit your repo's landing page and select "manage topics."