udacity/deep-reinforcement-learningPublic

NotificationsYou must be signed in to change notification settings
Fork2.4k
Star5.1k

Repo for the Deep Reinforcement Learning Nanodegree program

www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893

License

MIT license

5.1k stars 2.4k forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
cheatsheet		cheatsheet
cross-entropy		cross-entropy
ddpg-bipedal		ddpg-bipedal
ddpg-pendulum		ddpg-pendulum
discretization		discretization
dqn		dqn
dynamic-programming		dynamic-programming
finance		finance
hill-climbing		hill-climbing
lab-taxi		lab-taxi
monte-carlo		monte-carlo
p1_navigation		p1_navigation
p2_continuous-control		p2_continuous-control
p3_collab-compet		p3_collab-compet
python		python
reinforce		reinforce
temporal-difference		temporal-difference
tile-coding		tile-coding
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Deep Reinforcement Learning Nanodegree

This repository contains material related to Udacity'sDeep Reinforcement Learning Nanodegree program.

Tutorials

The tutorials lead you through implementing various algorithms in reinforcement learning. All of the code is in PyTorch (v0.4) and Python 3.

Dynamic Programming: Implement Dynamic Programming algorithms such as Policy Evaluation, Policy Improvement, Policy Iteration, and Value Iteration.
Monte Carlo: Implement Monte Carlo methods for prediction and control.
Temporal-Difference: Implement Temporal-Difference methods such as Sarsa, Q-Learning, and Expected Sarsa.
Discretization: Learn how to discretize continuous state spaces, and solve the Mountain Car environment.
Tile Coding: Implement a method for discretizing continuous state spaces that enables better generalization.
Deep Q-Network: Explore how to use a Deep Q-Network (DQN) to navigate a space vehicle without crashing.
Robotics: Use a C++ API to train reinforcement learning agents from virtual robotic simulation in 3D. (External link)
Hill Climbing: Use hill climbing with adaptive noise scaling to balance a pole on a moving cart.
Cross-Entropy Method: Use the cross-entropy method to train a car to navigate a steep hill.
REINFORCE: Learn how to use Monte Carlo Policy Gradients to solve a classic control task.
Proximal Policy Optimization: Explore how to use Proximal Policy Optimization (PPO) to solve a classic reinforcement learning task. (Coming soon!)
Deep Deterministic Policy Gradients: Explore how to use Deep Deterministic Policy Gradients (DDPG) with OpenAI Gym environments.
- Pendulum: Use OpenAI Gym's Pendulum environment.
- BipedalWalker: Use OpenAI Gym's BipedalWalker environment.
Finance: Train an agent to discover optimal trading strategies.

Labs / Projects

The labs and projects can be found below. All of the projects use rich simulation environments fromUnity ML-Agents. In theDeep Reinforcement Learning Nanodegree program, you will receive a review of your project. These reviews are meant to give you personalized feedback and to tell you what can be improved in your code.

The Taxi Problem: In this lab, you will train a taxi to pick up and drop off passengers.
Navigation: In the first project, you will train an agent to collect yellow bananas while avoiding blue bananas.
Continuous Control: In the second project, you will train an robotic arm to reach target locations.
Collaboration and Competition: In the third project, you will train a pair of agents to play tennis!

Resources

Cheatsheet: You are encouraged to usethis PDF file to guide your study of reinforcement learning.

OpenAI Gym Benchmarks

Classic Control

Acrobot-v1 withTile Coding and Q-Learning
Cartpole-v0 withHill Climbing | solved in 13 episodes
Cartpole-v0 withREINFORCE | solved in 691 episodes
MountainCarContinuous-v0 withCross-Entropy Method | solved in 47 iterations
MountainCar-v0 withUniform-Grid Discretization and Q-Learning | solved in <50000 episodes
Pendulum-v0 withDeep Deterministic Policy Gradients (DDPG)

Box2d

BipedalWalker-v2 withDeep Deterministic Policy Gradients (DDPG)
CarRacing-v0 withDeep Q-Networks (DQN) |Coming soon!
LunarLander-v2 withDeep Q-Networks (DQN) | solved in 1504 episodes

Toy Text

FrozenLake-v0 withDynamic Programming
Blackjack-v0 withMonte Carlo Methods
CliffWalking-v0 withTemporal-Difference Methods

Dependencies

To set up your python environment to run the code in this repository, follow the instructions below.

Create (and activate) a new environment with Python 3.6.

Linux orMac:

conda create --name drlnd python=3.6source activate drlnd

Windows:

conda create --name drlnd python=3.6 activate drlnd

If running inWindows, ensure you have the "Build Tools for Visual Studio 2019" installed from thissite. Thisarticle may also be very helpful. This was confirmed to work in Windows 10 Home.
Follow the instructions inthis repository to perform a minimal install of OpenAI gym.
- Next, install theclassic control environment group by following the instructionshere.
- Then, install thebox2d environment group by following the instructionshere.

Clone the repository (if you haven't already!), and navigate to thepython/ folder. Then, install several dependencies.

git clone https://github.com/udacity/deep-reinforcement-learning.gitcd deep-reinforcement-learning/pythonpip install.

Create anIPython kernel for thedrlnd environment.

python -m ipykernel install --user --name drlnd --display-name"drlnd"

Before running code in a notebook, change the kernel to match thedrlnd environment by using the drop-downKernel menu.

Want to learn more?

Come learn with us in theDeep Reinforcement Learning Nanodegree program at Udacity!

About

Repo for the Deep Reinforcement Learning Nanodegree program

www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Nanodegree

Table of Contents

Tutorials

Labs / Projects

Resources

OpenAI Gym Benchmarks

Classic Control

Box2d

Toy Text

Dependencies

Want to learn more?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors6

Uh oh!

Languages

Movatterモバイル変換

License

udacity/deep-reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Nanodegree

Table of Contents

Tutorials

Labs / Projects

Resources

OpenAI Gym Benchmarks

Classic Control

Box2d

Toy Text

Dependencies

Want to learn more?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors6

Uh oh!

Languages

Packages