Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The other technique fuses multiple operations into one kernel to reduce the overhead of running each operation separately.