Hanabi obl pytorch #63

ravihammond · 2024-03-05T12:13:42Z

Pytorch obl now works

hnekoeiq · 2024-03-15T21:20:20Z

Thanks for the great repo!
I was wondering if you have been able to reproduce the Hanabi IQL/VDN results? I just tried with the config file qlearn_hanabi.yaml (python baselines/QLearning/iql.py +alg=qlearn_hanabi +env=hanabi)
and the following is the agent's performance after almost 200 million timesteps:

Looking at the original Hanabi paper, 100 million steps should be enough to reach around 20. I'd appreciate it if you share your thoughts.

mttga · 2024-03-15T21:35:12Z

Hi @hnekoeiq, no we are still working on that. The implementations we have of IQL-VDN are baselines for simple environments, while the original c++ ones are much more sophisticated and use many tricks. Meanwhile you can use IPPO which is fast and converges.

ravihammond and others added 6 commits February 13, 2024 15:14

started making changes to jaxmarl to suport obl pytorch

581c642

added a constant for OBL file

df44093

obl added to manual game

46153d9

manual game works with obl pytorch

768b9b2

Merge branch 'hanabi_obl_aligned' into hanabi_obl_pytorch

9dda54a

removing mutage sync file

c027564

mttga merged commit aae9f99 into hanabi_obl_aligned Mar 5, 2024
6 checks passed

mttga deleted the hanabi_obl_pytorch branch March 5, 2024 13:36

ravihammond restored the hanabi_obl_pytorch branch March 15, 2024 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hanabi obl pytorch #63

Hanabi obl pytorch #63

ravihammond commented Mar 5, 2024

hnekoeiq commented Mar 15, 2024

mttga commented Mar 15, 2024

Hanabi obl pytorch #63

Hanabi obl pytorch #63

Conversation

ravihammond commented Mar 5, 2024

hnekoeiq commented Mar 15, 2024

mttga commented Mar 15, 2024