Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@pseudo-rnd-thoughts
pseudo-rnd-thoughts
Follow
View pseudo-rnd-thoughts's full-sized avatar

Mark Towers pseudo-rnd-thoughts

PhD Student at the University of Southampton exploring Explainable Reinforcement Learning

    Organizations

    @Farama-Foundation

    Block or report pseudo-rnd-thoughts

    Block user

    Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

    You must be logged in to block users.

    Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
    Report abuse

    Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

    Report abuse

    A PhD Student exploring Explainable Reinforcement Learning and project manager of Gymnasium and Gym.

    • ☀️ By day at the University of Southampton, I explore how to understand and explain the decision making of reinforcement learning agent, in particular, the goals and future aims of an agent. This work is completed within the MINDS CDT with a sponsorship from the Royal Bank of Canada.
    • 🌙 By night (and often during the day), I am the project manager ofGymnasium andGym, the de facto Reinforcement Learning environment APIs. This is as I am member of theFarama Foundation, you can read more about ithere

    PinnedLoading

    1. Farama-Foundation/GymnasiumFarama-Foundation/GymnasiumPublic

      An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

      Python 8.6k 958

    2. temporal-reward-decompositiontemporal-reward-decompositionPublic

      Implementation of "Explaining an Agent’s Future Beliefs through Temporally Decomposing Future Reward Estimators"

      Python 3

    3. temporal-explanations-4-drltemporal-explanations-4-drlPublic

      Implementation of "Temporal Explanations for Explainable Reinforcement Learning"

      Jupyter Notebook


    [8]ページ先頭

    ©2009-2025 Movatter.jp