jrcalgo/RelayRL-prototypePublic

NotificationsYou must be signed in to change notification settings
Fork0
Star1

Unstable Single-Agent Distributed Reinforcement Learning Framework written in Rust & exported to Python

License

Apache-2.0 license

1 star 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
examples		examples
relayrl_framework		relayrl_framework
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Repository files navigation

RelayRL, a Distributed Reinforcement Learning Framework

Warning:
This project is aprototype and isunstable during training. Use at your own risk and expect breaking changes.
Platform Support:RelayRL runs onMacOS, Linux, and Windows (x86_64). Some dependencies may require additional setup on Windows.

RelayRL is a high-performance, distributed reinforcement learning (RL) framework designed for research and experimentation. By using proven transport-layer communication protocols, RelayRL integrates a client-server architecture. It provides a robust Rust backend with Python bindings, enabling seamless integration with modern ML workflows and high-speed system-level orchestration.

Single-Agent Focus

RelayRL isprimarily designed for single-agent RL.
While it is theoretically possible to launch multiple agent and training server processes for multi-agent scenarios, the framework doesnot natively support scalable multi-agent orchestration at this time.

Features

Distributed RL Orchestration:Run agents and training servers as separate processes, communicating via gRPC or ZeroMQ.
Python & Rust Interoperability:Natively supports built-in or custom Python PyTorch for RL algorithms and Rust for orchestration and performance-critical components.
Extensible Algorithm/Environment Support:Plug in your own RL algorithms/environments in Python (seeexamples/ for Jupyter notebooks).
Benchmarking & Profiling:Tools and scripts for measuring network and runtime performance.
Configurable Experiments:JSON-based configuration for easy experiment management.

Quick Start

Install RelayRL Framework:

pip install relayrl_framework

Build From Source

Supported Platforms: MacOS, Linux, Windows (x86_64)

Clone the repository:

git clone https://github.com/jrcalgo/RelayRL-prototype.gitcd RelayRL-prototype

Build the Rust framework:

cd relayrl_frameworkcargo build --release

Install Python bindings (from therelayrl_framework directory):

pip install torch==2.5.1pip install maturinmaturin develop --release

Run an example (seeexamples/):
- Jupyter notebooks demonstrate how to use the Python API for training and evaluation.

Tutorial

See the examples' README!

Directory Structure

relayrl_framework/ — Core Rust framework and Python bindings
examples/ — Example scripts and Jupyter notebooks (Python API usage)
README.md — This file (project overview)
CONTRIBUTING.md — OSS Contribution instructions for relayrl_framework crate
relayrl_framework/README.md — Framework-level and development instructions
examples/README.md — Python integration and API utilization instructions

Repository Feature Roadmap

Improved training stability and error handling for gRPC and ZeroMQ implementations
Cleaned up user-level APIs (RelayRLAgent, TrainingServer, ConfigLoader, etc.)
Better documentation and API reference
Benchmarking against known frameworks (RayRLLib, Stable-Baselines3, CleanRL, etc.)
Preservation of most critical components/logic

Related Work (Currently in-progress in a separate repository)

Improved multi-agent support and orchestration by enhancing component modularity
Enhanced stability, scalability and error handling during training through comprehensive network error handling, backpressure, configurable components, etc.
Real-time configuration updates and shutdown/init signals through lifecycle control
Exposing scalable components for containerized automation
Easing execution and external application integration through compilation into CLI executables
More built-in RL algorithms available

Disclaimer

This project is aprototype and isunstable during training.As of now, although both transport loops are functional, ZMQ is provably more stable than gRPC in training tasks.APIs and features are subject to change.
Contributions and feedback are welcome!

Contributions

See contribution instructions!

License

Apache License 2.0

About

Unstable Single-Agent Distributed Reinforcement Learning Framework written in Rust & exported to Python

Releases

1tags

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

RelayRL, a Distributed Reinforcement Learning Framework

Single-Agent Focus

Features

Quick Start

Build From Source

Tutorial

Directory Structure

Repository Feature Roadmap

Related Work (Currently in-progress in a separate repository)

Disclaimer

Contributions

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Languages