Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

mttga · 2024-03-22T10:30:44Z

This merge contains the following updates:

A new, properly tested Hanabi environment from which we can achieve SOTA performances (+ nice features like rendering, playing manual games, and the possibility to use pretrained OBL models).
A new Dockerfile based on the NVIDIA official JAX image.
A refinement of the requirements files, avoiding multiple installation types (dev, qlearning, etc.). Also, the requirements become very strict in order to mirror as much as possible the NVIDIA image and to prevent people from using different JAX-JAXlib-FLAX-BRAX versions (important since we're re-collecting results in the next sprint). This might be too strict in some cases, but too many people reported installation issues, so I think it's worth it.
Support for Python 3.10 (and removal of support for Python 3.8-3.9). This is also done with the goal of mirroring the NVIDIA image, which uses Python 3.10.
A fix for issue Unable to replicate performance with Q-Learning on SMAX #66 (which also drastically improves the reported TransfQMix results on the "sz" based maps).
Fix of a deprecated use of jnp.concat (instead of jnp.concatenate) in smax, which causes problems when using the NVIDIA image.

Hanabi obl pytorch

mttga · 2024-03-22T10:35:04Z

It says 10k additions which is crazy but it's because we're adding a ground-truth file for testing hanabi which contains 10k game scores.

benellis3 · 2024-03-22T10:37:24Z

Have you tested that the PPO code runs with the new requirements for other environments? Otherwise this could break a lot of stuff.

tests/hanabi/test_hanabi.py

mttga · 2024-03-22T11:28:17Z

Have you tested that the PPO code runs with the new requirements for other environments? Otherwise this could break a lot of stuff.

I've checked that all the scripts run and I collected results for both IPPO and MAPPO with rnns in mpe and smax here: https://wandb.ai/mttga/jaxmarl_pull_request_71?nw=nwusermttga

Let me know if you want to see other results

mttga · 2024-03-22T11:49:26Z

Overcooked:

benellis3 · 2024-03-22T12:48:20Z

LGTM. Thanks very much for all this hard work this is awesome 😄 . Feel free to merge when you are ready.

mttga and others added 25 commits February 7, 2024 21:07

obl_obs_v0

f54a375

solved discard pile bug

3765afc

add rendering, manual game and solved colors' bugs

8bd3382

started making changes to jaxmarl to suport obl pytorch

581c642

added a constant for OBL file

df44093

5th card played bug and discard not legal bug

a0e312c

obl added to manual game

46153d9

manual game works with obl pytorch

768b9b2

Merge branch 'hanabi_obl_aligned' into hanabi_obl_pytorch

9dda54a

removing mutage sync file

c027564

Merge pull request #63 from FLAIROx/hanabi_obl_pytorch

aae9f99

Hanabi obl pytorch

ready for test

a535496

ready for test

a3638b1

ready for test

bed7b60

ready for test

475ead0

string observation and better flax model

bf641e0

hanabi ready to be merged

1fb473a

Merge remote-tracking branch 'origin/main' into hanabi_obl_aligned

fcbae7a

modified the toml by mistake

e11609e

solved issue #66 Unable to replicate performance with Q-Learning on SMAX

8059e1b

moving to nvidia dockerfile and python 3.10

b70f597

git test workflow update

491bbf7

explicit the jaxlib version to prevent jax-jaxlib mismatches + cleanup

a8dab70

minor readme adjustments

dd3bcf8

deprecated use of jnp.concat instead of jnp.concatenate in smax

8615058

mttga requested review from benellis3 and amacrutherford March 22, 2024 10:30

amacrutherford requested changes Mar 22, 2024

View reviewed changes

tests/hanabi/test_hanabi.py Show resolved Hide resolved

amacrutherford approved these changes Mar 22, 2024

View reviewed changes

benellis3 approved these changes Mar 22, 2024

View reviewed changes

mttga merged commit 8f17f22 into main Mar 22, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

mttga commented Mar 22, 2024 •

edited

Loading

mttga commented Mar 22, 2024

benellis3 commented Mar 22, 2024 •

edited

Loading

mttga commented Mar 22, 2024

mttga commented Mar 22, 2024

benellis3 commented Mar 22, 2024 •

edited

Loading

Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

Conversation

mttga commented Mar 22, 2024 • edited Loading

mttga commented Mar 22, 2024

benellis3 commented Mar 22, 2024 • edited Loading

mttga commented Mar 22, 2024

mttga commented Mar 22, 2024

benellis3 commented Mar 22, 2024 • edited Loading

mttga commented Mar 22, 2024 •

edited

Loading

benellis3 commented Mar 22, 2024 •

edited

Loading

benellis3 commented Mar 22, 2024 •

edited

Loading