5fa1a76
1
2
3
4
5
6
7
8
GPU When you train bigger models you have essentially three options: bigger GPUs more GPUs more CPU and NVMe (offloaded to by DeepSpeed-Infinity) Let's start at the case where you have a single GPU.