Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CartPoleLearningSystem.m		CartPoleLearningSystem.m
Final Paper_RL on Control1570364775.pdf		Final Paper_RL on Control1570364775.pdf
Final year project report final draft.pdf		Final year project report final draft.pdf
JacksCarRentalProblem.m		JacksCarRentalProblem.m
QLearningCartPole.m		QLearningCartPole.m
QLearningCartPoleLeastTrials.m		QLearningCartPoleLeastTrials.m
QLearningCartPoleThetaCheck.m		QLearningCartPoleThetaCheck.m
QSwingUp.m		QSwingUp.m
README.md		README.md
SarsaLearningCartPole.m		SarsaLearningCartPole.m
SarsaLearningCartPoleLeastTrials.m		SarsaLearningCartPoleLeastTrials.m
SarsaLearningCartPoleSwingUp.m		SarsaLearningCartPoleSwingUp.m
SarsaLearningCartPoleThetaCheck.m		SarsaLearningCartPoleThetaCheck.m
SwingUpController.m		SwingUpController.m
SwingUpController2.m		SwingUpController2.m
cartPole.m		cartPole.m
cart_pole.m		cart_pole.m
cart_pole2.m		cart_pole2.m
cmpt_P_and_R.m		cmpt_P_and_R.m
getBox.m		getBox.m
getBox2.m		getBox2.m
getBox3.m		getBox3.m
getBox4.m		getBox4.m
getBox5.m		getBox5.m
getBox6.m		getBox6.m
linfun1.m		linfun1.m
linfun1SwingUp.m		linfun1SwingUp.m
probPushRight.m		probPushRight.m
takeAction2.m		takeAction2.m

Repository files navigation

Reinforcement-learning-Algorithms-and-Dynamic-Programming

Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL controller have been integrated with a Swing up controller. A virtual switch toggles between the Swing up controller and the RL controller automatically, based on the value of the angular deviation theta with respect to the vertical plane. My research paper and my undergraduate thesis have been uploaded for reference. All the codes have also been uploaded.
Dynamic Programming was applied to Jack's Car Rental Problem to maximize the amount Jack gets by the end of the day, with some Constraints.

About

Releases1

v1.0.0 Latest

May 26, 2022

Packages

No packages published

Languages

MATLAB100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-learning-Algorithms-and-Dynamic-Programming

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Packages

Languages

Movatterモバイル変換

savinay95n/Reinforcement-learning-Algorithms-and-Dynamic-Programming

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-learning-Algorithms-and-Dynamic-Programming

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Packages0

Languages

Packages