Younes Belkada's picture

Younes Belkada

ybelkada

·

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Organizations

ybelkada's activity

published an article 6 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

published an article 10 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 283

published an article 11 months ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20, 2024

• 26

published an article 11 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18, 2024

• 33

published an article 12 months ago

Article

Fine-Tuning Gemma Models in Hugging Face

Feb 23, 2024

• 27

published an article about 1 year ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 335

published an article about 1 year ago

Article

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Dec 11, 2023

• 11

published an article over 1 year ago

Article

Overview of natively supported quantization schemes in 🤗 Transformers

Sep 12, 2023

• 11

published an article over 1 year ago

Article

Making LLMs lighter with AutoGPTQ and transformers

Aug 23, 2023

• 39

published an article over 1 year ago

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 40

published an article over 1 year ago

Article

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

• 12

published an article over 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 112

published an article over 1 year ago

Article

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 15

published an article almost 2 years ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 26

published an article almost 2 years ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 37

published an article over 2 years ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 71