songy / transformers /docs /source /en /perf_train_tpu.md
trishv's picture
Upload 2383 files
96e9536
|
raw
history blame
1.11 kB

Training on TPUs

Note: Most of the strategies introduced in the single GPU section (such as mixed precision training or gradient accumulation) and multi-GPU section are generic and apply to training models in general so make sure to have a look at it before diving into this section.

This document will be completed soon with information on how to train on TPUs.