3 - Advantage Actor Critic (A2C) [CartPole].ipynb - Returns do not need to be detached #3

nimrare · 2021-02-04T14:22:38Z

Hi Ben

Thanks for the interesting notebooks. Upon studying the "3 - Advantage Actor Critic (A2C) [CartPole].ipynb" notebook, I came to the conclusion that detaching the returns in the update_policy() function is not necessary. The returns are only calculated on the rewards which are environment outputs and therefore not part of the computational graph. So even leaving out the .detach() call should not affect the model. Would you agree?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3 - Advantage Actor Critic (A2C) [CartPole].ipynb - Returns do not need to be detached #3

3 - Advantage Actor Critic (A2C) [CartPole].ipynb - Returns do not need to be detached #3

nimrare commented Feb 4, 2021

3 - Advantage Actor Critic (A2C) [CartPole].ipynb - Returns do not need to be detached #3

3 - Advantage Actor Critic (A2C) [CartPole].ipynb - Returns do not need to be detached #3

Comments

nimrare commented Feb 4, 2021