directory containing checkpoints | |
accelerator.load_state("ckpt") | |
However, when training ends, you want to save the full state dict because sharded state dict is only compatible with FSDP. |
directory containing checkpoints | |
accelerator.load_state("ckpt") | |
However, when training ends, you want to save the full state dict because sharded state dict is only compatible with FSDP. |