Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop sad crossplay #1

Open
wants to merge 47 commits into
base: develop-SAD
Choose a base branch
from
Open

Conversation

Danielhp95
Copy link
Collaborator

Finished coding cross-play matrix computation as defined in "Other-Play" for Zero-Shot Coordination, Hu et al

Near32 and others added 30 commits February 24, 2021 15:20
+ updated ActorCriticNet and derived classes to rnn/frame_state standards.
+ updated trajectory gif plotter with celluloid.
+ added a parameterized render fn to VecEnv.
+ added residual/skip connections to some network bodies to bypass RNNs.
…rder to make sure it won't fail silently, just in case...
…entialGym.

TODO: implement and test metrics.
TODO: parallelised rl loop and supervised learning loop test.
IN PROGRESS: debugging of multi-step CIC metric: cannot differentiate between comm. and action-only rule-based agents...
+ needed to add a query method to agents in order to deal with computing losses using agents outside of agents' methods.
+ update marl_loop to new dict-outputting vec-env [TODO: test further...]
…a sequence, as done by R2D2 (but not DQN...).
…R... (no sequence of experience to handle during loss computation...)

+ updating final HER strategy in HER wrapper v2.
Near32 added 17 commits July 17, 2021 12:40
…agents.

+ all comaze metric module have a deterministic trigger.
+ implemented OP fix attempt.
TODO: debug...
…dules biasing the main agents.

+ implemented some biasing modules for S2B.
…y losses modules.

+ added a multi reconstruction module.
+ fixed issue with test_agent fn's returning total return in place of total int return.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants