An Implementation of the REINFORCE Algorithm for Solving OpenAI Gymnasium Environments
The REINFORCE algorithm was introduced in the paper:
Williams, R. J. (1992). "Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning."
Journal: Mach Learn 8, 229–256.
DOI: 10.1007/BF00992696