Movatterモバイル変換

zuoxingdong/lagomPublic

NotificationsYou must be signed in to change notification settings
Fork31
Star377

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

License

MIT license

377 stars 31 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 703 Commits
.circleci		.circleci
baselines		baselines
docs		docs
examples		examples
lagom		lagom
legacy		legacy
scripts		scripts
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
readthedocs.yml		readthedocs.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

lagom

A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

lagom is a 'magic' word in Swedish,inte för mycket och inte för lite, enkelhet är bäst (not too much and not too little, simplicity is often the best). It is the philosophy on which this library was designed.

Why to use lagom ?

lagom balances between the flexibility and the usability when developing reinforcement learning (RL) algorithms. The library is built on top ofPyTorch and provides modular tools to quickly prototype RL algorithms. However, it does not go overboard, because too low level is often time consuming and prone to potential bugs, while too high level degrades the flexibility which makes it difficult to try out some crazy ideas fast.

We are continuously makinglagom more 'self-contained' to set up and run experiments quickly. It internally supports base classes for multiprocessing (master-worker framework) for parallelization (e.g. experiments and evolution strategies). It also supports hyperparameter search by defining configurations either as grid search or random search.

Table of Contents

Installation

We highly recommand using an Miniconda environment:

conda create -n lagom python=3.7

Install dependencies

pip install -r requirements.txt

We also provide some bash scripts inscripts/ directory to automatically set up the system configurations, conda environment and dependencies.

Install lagom from source

git clone https://github.com/zuoxingdong/lagom.gitcd lagompip install -e.

Installing from source allows to flexibly modify and adapt the code as you pleased, this is very convenient for research purpose.

Documentation

The documentation hosted by ReadTheDocs is available online athttp://lagom.readthedocs.io

RL Baselines

We implemented a collection of standard reinforcement learning algorithms atbaselines using lagom.

How to use lagom

A common pipeline to uselagom can be done as following:

Define yourRL agent
Define yourenvironment
Define yourengine for training and evaluating the agent in the environment.
Define yourConfigurations for hyperparameter search
Definerun(config, seed, device) for your experiment pipeline
Callrun_experiment(run, config, seeds, num_worker) to parallelize your experiments

A graphical illustration is coming soon.

Examples

We provide a few simpleexamples.

Test

We are usingpytest for tests. Feel free to run via

pytesttest -v

What's new

2019-03-04 (v0.0.3)
- Much easier and cleaner APIs
2018-11-04 (v0.0.2)
- More high-level API designs
- More unit tests
2018-09-20 (v0.0.1)
- Initial release

Reference

This repo is inspired byOpenAI Gym,OpenAI baselines,OpenAI Spinning Up

Please use this bibtex if you want to cite this repository in your publications:

@misc{lagom,      author = {Zuo, Xingdong},      title = {lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms},      year = {2018},      publisher = {GitHub},      journal = {GitHub repository},      howpublished = {\url{https://github.com/zuoxingdong/lagom}},    }

About

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Releases6

Minor updates Latest

May 23, 2019

+ 5 releases

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

lagom

Why to use lagom ?

Installation

Install dependencies

Install lagom from source

Documentation

RL Baselines

How to use lagom

Examples

Test

What's new

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases6

Packages

Uh oh!

Contributors5

Uh oh!

Languages

Movatterモバイル変換

License

zuoxingdong/lagom

Folders and files

Latest commit

History

Repository files navigation

lagom

Why to use lagom ?

Installation

Install dependencies

Install lagom from source

Documentation

RL Baselines

How to use lagom

Examples

Test

What's new

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases6

Packages0

Uh oh!

Contributors5

Uh oh!

Languages

Packages