Auxilary Load Balancing Loss is Dropped

#32
by codys12 - opened

The auxilary loss is dropped in the HunYuanMoE.forward() with no passing it to the final loss. It is not used for training.

Sign up or log in to comment