Way to restart experiments #45

matthiasreisser · 2017-04-26T08:23:39Z

Use-case: Experiment terminates, now I want to resume training from a saved checkpoint. The Experiment Directory should now be the same as the initial run since ideally model checkpoints, (and for example tensorflow summaries) are within the previous experiment's folder

Terminal Print out either attatched to previous output.txt or an output2.txt created, etc...

petered · 2017-07-26T07:57:05Z

Thought about this.....

One way would be to make extend a PersistentExperiment base class with an __init__ and an abstract "run_step" method. It's "run" method would repeatedly call run_step while either periodically saving checkpoints or catching keyboard interrupts and saving from there.

A small obstacle is that everything in your experiment needs to be picklable. (so no lambda functions, etc)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Way to restart experiments #45

Way to restart experiments #45

matthiasreisser commented Apr 26, 2017

petered commented Jul 26, 2017 •

edited

Loading

Way to restart experiments #45

Way to restart experiments #45

Comments

matthiasreisser commented Apr 26, 2017

petered commented Jul 26, 2017 • edited Loading

petered commented Jul 26, 2017 •

edited

Loading