- Notifications
You must be signed in to change notification settings - Fork54
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
License
NotificationsYou must be signed in to change notification settings
jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
Link topaper
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym.
This paper introduces Wolpertinger training algorithm that extends the Deep Deterministic Policy Gradient training algorithm introduced inthis paper.
I used and extendedstevenpjg's implementation ofDDPG algorithm foundhere licensed under the MIT license.
Master is currentlyonly for continuous action spaces.
The branch discrete-and-continuous provides the ability to use the discrete environments of the gym.
About
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
No packages published