`project_configuration.automatic_checkpoint_naming` synchronization between `load_state` and `save_state` #3306

diffunity · 2024-12-19T07:34:46Z

I am having trouble finding ways to synchronize the checkpoint names configured by save_state with that used by load_state.

What I mean is when we save_state with project_configuration.automatic_checkpoint_naming = True, a checkpoint folder is created at output_dir/checkpoint_0 and the accelerator object keeps track of the checkpoint iterations with the class variable self.project_configuration.iteration at here.

If I reinitialize the accelerator object and load_state on, say, output_dir/checkpoint_5. The self.project_configuration.iteration is initialized at 0 for this new accelerator object. Therefore, if I do save_state, it saves to output_dir/checkpoint_0. Is there a way to synchronize this class variable during load_state so that I don't have to designate the exact checkpoint iteration?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`project_configuration.automatic_checkpoint_naming` synchronization between `load_state` and `save_state` #3306

`project_configuration.automatic_checkpoint_naming` synchronization between `load_state` and `save_state` #3306

diffunity commented Dec 19, 2024

project_configuration.automatic_checkpoint_naming synchronization between load_state and save_state #3306

project_configuration.automatic_checkpoint_naming synchronization between load_state and save_state #3306

Comments

diffunity commented Dec 19, 2024

`project_configuration.automatic_checkpoint_naming` synchronization between `load_state` and `save_state` #3306

`project_configuration.automatic_checkpoint_naming` synchronization between `load_state` and `save_state` #3306