You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is a bug in inner_step_get_grad_otheragent2 where the return value uses trainstate_th2.params and trainstate_val2.params. I think that should be trainstate_th2_.params and trainstate_val2_.params? As it is, it takes params from the trainstates in the global scope which haven't been trained. However, with the way the result of inner_step_get_grad_otheragent2 is used in the caller, I think this bug should have no effect unless args.inner_steps > 2.
There is a bug in
inner_step_get_grad_otheragent2
where the return value usestrainstate_th2.params
andtrainstate_val2.params
. I think that should betrainstate_th2_.params
andtrainstate_val2_.params
? As it is, it takes params from the trainstates in the global scope which haven't been trained. However, with the way the result ofinner_step_get_grad_otheragent2
is used in the caller, I think this bug should have no effect unlessargs.inner_steps > 2
.POLA/jax_files/POLA_dice_jax.py
Line 541 in 6b07e89
The text was updated successfully, but these errors were encountered: