Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

License

NotificationsYou must be signed in to change notification settings

jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Link topaper

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym.

This paper introduces Wolpertinger training algorithm that extends the Deep Deterministic Policy Gradient training algorithm introduced inthis paper.

I used and extendedstevenpjg's implementation ofDDPG algorithm foundhere licensed under the MIT license.

Master is currentlyonly for continuous action spaces.

The branch discrete-and-continuous provides the ability to use the discrete environments of the gym.

About

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp