File size: 200 Bytes
5fa1a76
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
GPU
When you train bigger models you have essentially three options:

bigger GPUs
more GPUs
more CPU and NVMe (offloaded to by DeepSpeed-Infinity)

Let's start at the case where you have a single GPU.