Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

A Tensorflow implementation of the Option-Critic Architecture

NotificationsYou must be signed in to change notification settings

tdavchev/option-critic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

(Bacon PL, Harb J, Precup D)

This is a TensorFlow implementation of the option critic architecture](https://arxiv.org/pdf/1609.05140.pdf) by Bacon PL, Harb J and Precup D.The release of the code was influenced by the recent Baseline initiative fromOpenAI and is aimed to serve as a starting point in Hierarchical RL, more specifically in options learning.

Providing a unified common ground through the adoption of a single framework will make it easier for the research community to replicate, refine, and identify new ideas, and will create good baselines to build research on top of as as recently pointed out by J. Schulman. This implementation of the option-critic architecture has been influenced by the original Theano implemntation byJ. Haarb as well as the extremely helpful repository ofD. Britz and the very engaging posts byA. Juliani.

We hope to be providing the community with a useful tensorflow implementation of a popular options learning architecture. This, however, is a research-oriented implementation thus it might not comply with the most efficient code practices or be fully functional across platforms.

You can download it by typing:

git clone git@github.com:yadrimz/option-critic.git

Requirements:

  • tensorflow-1.1.0
  • python 2.7
  • gym[atari]

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp