Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

jaganrvce1 · 2025-02-14T02:52:28Z

Is there a sample code someone can point me to that I can point me to checkpoint to see if the training worked?
There seems to be two checkpoints, one is the critic and actor? How do I combine them in a single program?

I would appreciate if someone points me to a piece of code that shows how actor/critic are used together etc. Thanks!

prvnsmpth · 2025-02-18T05:53:39Z

I think you can just load up the actor model using huggingface transformers and run inference. The critic model only exists to evaluate the actor model's outputs during training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

jaganrvce1 commented Feb 14, 2025

prvnsmpth commented Feb 18, 2025

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

Comments

jaganrvce1 commented Feb 14, 2025

prvnsmpth commented Feb 18, 2025