Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

[RLLib-contrib] Implementation of RED-Q (Ensemble SAC) Algorithm in PyTorch#39747

Open
aivarsoo wants to merge2 commits intoray-project:master
base:master
Choose a base branch
Loading
fromaivarsoo:redq-pytorch

Conversation

aivarsoo
Copy link

@aivarsooaivarsoo commentedSep 19, 2023
edited
Loading

Why are these changes needed?

Implementation of theRED-Q algorithm (SAC with an ensemble of Q functions) in PyTorch. The implementation is based on the existing SAC implementation adding the ensemble functionality to model and policy classes.

Checks

  • I've signed off every commit(by using the -s flag, i.e.,git commit -s) in this PR.
  • I've runscripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed forhttps://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it indoc/source/tune/api/ under the
      corresponding.rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures athttps://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Aivar Sootla <aivar.sootla@gmail.com>
Signed-off-by: Aivar Sootla <aivar.sootla@gmail.com>
@anyscalesamanyscalesam added triageNeeds triage (eg: priority, bug/not-bug, and owning component) rllibRLlib related issues labelsMay 15, 2024
@hainesmichaelchainesmichaelc added the community-contributionContributed by the community labelApr 4, 2025
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers
No reviews
Assignees
No one assigned
Labels
community-contributionContributed by the communityrllibRLlib related issuestriageNeeds triage (eg: priority, bug/not-bug, and owning component)
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

3 participants
@aivarsoo@hainesmichaelc@anyscalesam

[8]ページ先頭

©2009-2025 Movatter.jp