view article Article Optimize and deploy models with Optimum-Intel and OpenVINO GenAI Sep 20, 2024 • 22
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding Jan 30, 2024 • 9
view article Article AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU Dec 5, 2023 • 2
view article Article Introducing Optimum: The Optimization Toolkit for Transformers at Scale Sep 14, 2021 • 1