File size: 187 Bytes
5fa1a76
 
 
 
1
2
3
4
directory containing checkpoints
accelerator.load_state("ckpt")

However, when training ends, you want to save the full state dict because sharded state dict is only compatible with FSDP.