view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 108
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware Mar 20, 2024 • 26
view article Article Overview of natively supported quantization schemes in 🤗 Transformers Sep 12, 2023 • 11
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 112
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 15
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 71