Stars
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Automate browser-based workflows with LLMs and Computer Vision
An elegant PyTorch deep reinforcement learning library.
A toolkit for reproducible reinforcement learning research.
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.