Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
139 Bytes
When the communication is done in fp16 or bf16, it is more likely to be lossy because adding multiple numbers in low precision isn't exact.