28 63 75

Elie Bakouch

eliebak

AI & ML interests

Training LLM's @ 🤗

Recent Activity

updated a Space about 9 hours ago

open-r1/open-r1-eval-leaderboard

updated a Space about 9 hours ago

open-r1/open-r1-eval-leaderboard

updated a Space about 9 hours ago

open-r1/open-r1-eval-leaderboard

View all activity

Organizations

eliebak's activity

upvoted an article 3 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

3 days ago

• 19

upvoted an article 4 days ago

Article

Open R1: Update #2

and 6 others •

4 days ago

• 154

upvoted a paper 5 days ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 10 days ago • 15

upvoted a paper 8 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 10 days ago • 161

upvoted a paper 10 days ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published 14 days ago • 6

upvoted 2 articles 10 days ago

Article

Open-source DeepResearch – Freeing our search agents

11 days ago

• 964

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

11 days ago

• 45

upvoted an article 12 days ago

Article

Open-R1: Update #1

and 7 others •

13 days ago

• 276

upvoted an article 14 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

14 days ago

• 34

upvoted 2 articles 16 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

22 days ago

• 62

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

16 days ago

• 16

upvoted a paper 16 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published 21 days ago • 17

upvoted a paper 17 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 43

upvoted an article 17 days ago

Article

Welcome to Inference Providers on the Hub 🔥

18 days ago

• 341

upvoted an article 18 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

18 days ago

• 734

upvoted an article 25 days ago

Article

Yay! Organizations can now publish blog Articles

and 3 others •

25 days ago

• 33

upvoted an article 29 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

•

about 1 month ago

• 40

upvoted a collection about 1 month ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 8 days ago • 233

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

upvoted a collection about 1 month ago

DolphinLabeled Datasets

Collection

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 12