FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design Paper • 2401.14112 • Published Jan 25, 2024 • 19 • 7
The Impact of Reasoning Step Length on Large Language Models Paper • 2401.04925 • Published Jan 10, 2024 • 17 • 2