Movatterモバイル変換

SafeRL-Lab/SCPOPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star0

SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization

License

MIT license

0 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
controls		controls
graphs		graphs
gym_safety		gym_safety
plots		plots
scripts		scripts
stable_baselines3		stable_baselines3
tests		tests
utils		utils
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.readthedocs.yml		.readthedocs.yml
CITATION.bib		CITATION.bib
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
play.py		play.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
test.py		test.py
test2.py		test2.py

Repository files navigation

Safty Critic Policy Interation (SCPO)

SCPO is a safe reinforcement learning algorithm. This repo is a fork ofStable Baselines3.

Installation

Note: Stable-Baselines3 supports PyTorch >= 1.11

Prerequisites

SCPO requires Python 3.7+.

Install using pip

Install the Stable Baselines3 package:

pip install -r requirements.txt

We use environments from Bullet-Safety-Gym. Please follow the installation steps fromhttps://github.com/SvenGronauer/Bullet-Safety-Gym.

If you want to run pytorch in gpu mode, please install cuda and pytorch separatelyhttps://pytorch.org/

Training

Example code for training can be found attrain.py. To train models with the best hyperparameters, please checktrain_best_hyper.py.

Running the environment

Checkplay.py.

Benchmark

Citation

If you find the repository useful, please cite the study

@article{mhamed2023scpo,  title={SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization},  author={Mhamed, Jaafar and Gu, Shangding},  journal={arXiv preprint arXiv:2311.00880},  year={2023}}

About

SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization

Releases

No releases published

Packages

No packages published

Contributors2

Languages

Python99.6%
Other0.4%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

Safty Critic Policy Interation (SCPO)

Installation

Prerequisites

Install using pip

Training

Running the environment

Benchmark

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages

Contributors2

Languages

Movatterモバイル変換

License

SafeRL-Lab/SCPO

Folders and files

Latest commit

History

Repository files navigation

Safty Critic Policy Interation (SCPO)

Installation

Prerequisites

Install using pip

Training

Running the environment

Benchmark

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Languages

Packages