view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model Oct 29, 2024 • 52
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding Jan 30, 2024 • 9