Skip to content
View pseudo-rnd-thoughts's full-sized avatar

Organizations

@Farama-Foundation

Block or report pseudo-rnd-thoughts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Welcome traveller, I'm Mark

A PhD Student exploring Explainable Reinforcement Learning and project manager of Gymnasium and Gym.

  • ☀️ By day at the University of Southampton, I explore how to understand and explain the decision making of reinforcement learning agent, in particular, the goals and future aims of an agent. This work is completed within the MINDS CDT with a sponsorship from the Royal Bank of Canada.
  • 🌙 By night (and often during the day), I am the project manager of Gymnasium and Gym, the de facto Reinforcement Learning environment APIs. This is as I am member of the Farama Foundation, you can read more about it here

Pinned Loading

  1. Farama-Foundation/Gymnasium Farama-Foundation/Gymnasium Public

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

    Python 7.7k 870

  2. temporal-reward-decomposition temporal-reward-decomposition Public

    Implementation of "Explaining an Agent’s Future Beliefs through Temporally Decomposing Future Reward Estimators"

    Python 2

  3. temporal-explanations-4-drl temporal-explanations-4-drl Public

    Implementation of "Temporal Explanations for Explainable Reinforcement Learning"

    Jupyter Notebook