ErfanFathi/RL_CartpolePublic

NotificationsYou must be signed in to change notification settings
Fork0
Star6

Implementation of the Q-learning and SARSA algorithms to solve the CartPole-v1 environment. [Advance Machine Learning project - UniGe]

License

MIT license

6 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
plots		plots
videos		videos
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
policy.py		policy.py
q_agent.py		q_agent.py
q_table.py		q_table.py
requirements.txt		requirements.txt
run.sh		run.sh
sarsa_agent.py		sarsa_agent.py
utils.py		utils.py

Repository files navigation

Reinforcement Learning on CartPole-v1

This project implements the Q-learning and SARSA algorithms to solve the CartPole-v1 environment from OpenAI Gym. The Q-learning algorithm learns an optimal action-value function, while the SARSA algorithm learns an action-value function based on the current policy. The goal is to balance a pole on a cart by applying appropriate forces.

Usage

Clone the repository or download the source code files.

  git clone git@github.com:ErfanFathi/RL_Cartpole.git

1. Install the required packages.
```
  pip3 install -r requirements.txt
```
1. Run the script with the desired parameters. Use the following command to see the available options:
```
  python3 main.py --help
```
This script uses command-line arguments to configure the learning parameters and other settings. You can specify the following options:
- --algorithm: The algorithm to use for learning. Valid options areq_learning andsarsa. Default isq_learning.
- --alpha: The learning rate. Default is0.1.
- --gamma: The discount factor. Default is0.995.
- --epsilon: The probability of choosing a random action. Default is0.1.
- --num_episodes: The number of episodes to run. Default is1000.
- --num_steps: The maximum number of steps per episode. Default is500.
- --num_bins: The number of bins to use for discretizing the state space. Default is20.

e.g.:

 python3 main.py --algorithm q_learning --alpha 0.2 --gamma 0.99 --num_episodes 2000

1. The script will execute the chosen algorithm on the CartPole-v1 environment. It will print the name of the generated file containing the results.
1. After the execution, a plot of the rewards obtained during the learning process will be saved in theplots directory as a PNG file.
1. Additionally, frames of the agent's behavior will be rendered and saved as a GIF file in thevideos directory. This provides a visual representation of the learned policy.

Result

Finale

Feel free to use, modify this code. And please feel free to fork the codefrom Github and send pull requests.

Report any comment or bugs to:
fathierfan97@gmail.com

Regards,
Erfan Fathi

About

Implementation of the Q-learning and SARSA algorithms to solve the CartPole-v1 environment. [Advance Machine Learning project - UniGe]

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning on CartPole-v1

Usage

Result

Finale

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

ErfanFathi/RL_Cartpole

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning on CartPole-v1

Usage

Result

Finale

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages