Develop sad crossplay #1

Danielhp95 · 2021-03-05T15:41:52Z

Finished coding cross-play matrix computation as defined in "Other-Play" for Zero-Shot Coordination, Hu et al

…ossplay

performance metrics from test episodes

episode length or mean returns...)

+ updated ActorCriticNet and derived classes to rnn/frame_state standards. + updated trajectory gif plotter with celluloid. + added a parameterized render fn to VecEnv. + added residual/skip connections to some network bodies to bypass RNNs.

…rder to make sure it won't fail silently, just in case...

…entialGym. TODO: implement and test metrics. TODO: parallelised rl loop and supervised learning loop test.

IN PROGRESS: debugging of multi-step CIC metric: cannot differentiate between comm. and action-only rule-based agents...

+ needed to add a query method to agents in order to deal with computing losses using agents outside of agents' methods.

+ update marl_loop to new dict-outputting vec-env [TODO: test further...]

…xtra inputs.

…a sequence, as done by R2D2 (but not DQN...).

…R... (no sequence of experience to handle during loss computation...) + updating final HER strategy in HER wrapper v2.

…Maze benchmark.

…agents. + all comaze metric module have a deterministic trigger.

+ implemented OP fix attempt. TODO: debug...

…dules biasing the main agents. + implemented some biasing modules for S2B.

…y losses modules. + added a multi reconstruction module.

+ fixed issue with test_agent fn's returning total return in place of total int return.

… fn.

Near32 and others added 30 commits February 24, 2021 15:20

adding benchmark script for CoMaze

32686a9

Merge remote-tracking branch 'origin/develop-SAD' into develop-SAD-cr…

4959bc1

…ossplay

Added feature in test_agent to return a dictionary containing

2034d55

performance metrics from test episodes

Added random agent

cda36ed

Forgot to add to previous commit!

6ba9442

Added feature in fn test_agent to request performance metrics (like

910df1e

episode length or mean returns...)

Added code to compute cross-play matrices!

52523cc

update ther agent in progress...

59c8dbc

solved an issue with ext pos return of comaze. left an ipdb call in o…

7fe13a1

…rder to make sure it won't fail silently, just in case...

commented out some logs...

1e3687d

implemented pubsub pattern with rl loop and started integrating Refer…

23cb555

…entialGym. TODO: implement and test metrics. TODO: parallelised rl loop and supervised learning loop test.

adding support for rule-based agent in CoMaze.

df18a0f

implemented multi-step CIC metric and optimization and logger modules.

219e733

IN PROGRESS: debugging of multi-step CIC metric: cannot differentiate between comm. and action-only rule-based agents...

adding run script for CoMaze debugging.

d109781

added positive signalling metric module to CoMaze benchmark.

f30b128

adding comaze goal prediction.

5c7823b

+ needed to add a query method to agents in order to deal with computing losses using agents outside of agents' methods.

adding comaze-gym into envs...

9ead368

adding missing elements for comaze auxiliary task module...

dc71c08

update CoMaze config for OP concerns and train/test requirements.

19fa1ed

+ update marl_loop to new dict-outputting vec-env [TODO: test further...]

Merge branch 'develop' into develop-update-ther

df9a3e5

updated HER with R2D2 and goal integration is done seamlessly using e…

6fa6b14

…xtra inputs.

updating HER algo wrapper to take into account experience storing as …

267dab6

…a sequence, as done by R2D2 (but not DQN...).

adding DQN HER as an update to use DQN's lightweight training with HE…

e3d7100

…R... (no sequence of experience to handle during loss computation...) + updating final HER strategy in HER wrapper v2.

update comaze benchmark ...

e9e78c1

adding THER2 files...

2a1681d

adding testing scripts for CoMaze benchmark.

4eea8d1

adding recording expection for CSGPU2 in CoMaze benchmark script.

17170d5

Merge branch 'develop' of https://github.com/Near31/Regym into develop

5266752

updated the progress bar to fit to the use of reload agents...

792c4df

Near32 added 17 commits July 17, 2021 12:40

Merge branch 'develop' of https://github.com/Near32/Regym into develop

8a36594

making exception for recording on test script, in CSGPU2.

d6a5159

bug fix with the way rl agent's hidden states is being computed in Co…

d8a3dd7

…Maze benchmark.

Merge branch 'develop' of https://github.com/Near32/Regym into develop

530e4bc

implemented augmented way of probing goal ordering prediction for RL …

f294bd4

…agents. + all comaze metric module have a deterministic trigger.

implemented marl environment module.

0d1ff12

+ implemented OP fix attempt. TODO: debug...

debugged marl environment module and OP fix.

4eb250a

adding env-config.

e6637d1

Merge branch 'develop' of https://github.com/Near32/Regym into develop

5eabb3d

adding S2B benchmark.

485bfb6

update agent architecture to debug backpropagation issue with some mo…

fa6437a

…dules biasing the main agents. + implemented some biasing modules for S2B.

adding missing files of modules biasing main agents.

3a839f7

fixed issue with backpropagation through agent's core of the auxiliar…

ee10705

…y losses modules. + added a multi reconstruction module.

adding posdis rule based agents to S2B.

617fc38

simplify setup.py and made PyTorch==1.8.1 mandatory.

f94e8c0

merge-update with develop.

0439e6f

+ fixed issue with test_agent fn's returning total return in place of total int return.

updated cross_play infrastructure. TODO: full testing of the plotting…

252d81f

… fn.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop sad crossplay #1

Develop sad crossplay #1

Danielhp95 commented Mar 5, 2021

Develop sad crossplay #1

Are you sure you want to change the base?

Develop sad crossplay #1

Conversation

Danielhp95 commented Mar 5, 2021