Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
205516.8
TFLOPS
161
51
65
Leandro von Werra
lvwerra
Follow
junwux's profile picture
k0r1st's profile picture
Raman's profile picture
305 followers
·
56 following
https://github.com/lvwerra
lvwerra
lvwerra
AI & ML interests
NLP and RL
Recent Activity
upvoted
an
article
1 day ago
Open R1: Update #2
authored
a paper
5 days ago
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
upvoted
a
paper
5 days ago
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
View all activity
Organizations
Articles
29
Article
41
DABStep: Data Agent Benchmark for Multi-step Reasoning
Article
268
Open-R1: Update #1
View all Articles
Papers
14
arxiv:
2502.02737
arxiv:
2501.08365
arxiv:
2410.24198
arxiv:
2406.17557
Expand 14 papers
spaces
21
Sort: Recently updated
Sleeping
1
Executor
📚
Sleeping
3d Bench Viz
📈
Running
7
3d
🔥
Visualize 3D parallelism configuration
Sleeping
10
Train LLMs
⚡
Calculate training cost and model efficiency
Sleeping
Text Source Viz
👁
Runtime error
20
Harm Space
⚡
Expand 21 spaces
models
33
Sort: Recently updated
lvwerra/the-tokenizer-v1
Updated
Feb 12, 2024
•
1
lvwerra/sc2
Updated
Feb 11, 2024
•
2
lvwerra/starcoder-98k-no-regex-no-digits
Updated
Sep 29, 2023
lvwerra/starcoder-393k
Updated
Sep 28, 2023
lvwerra/starcoder-196k
Updated
Sep 28, 2023
lvwerra/starcoder-98k
Updated
Sep 27, 2023
lvwerra/starcoder-24k
Updated
Sep 27, 2023
lvwerra/starcoder-12k
Updated
Sep 27, 2023
lvwerra/starcoder-6k
Updated
Sep 27, 2023
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
Aug 30, 2023
•
23
Expand 33 models
datasets
22
Sort: Recently updated
lvwerra/dabstep
Viewer
•
Updated
7 days ago
•
3
•
2.8k
lvwerra/needle-llama3-16x524k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
245
•
1
lvwerra/needle-llama3-16x65k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
97
•
1
lvwerra/needle-llama3-16x8k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
46
•
1
lvwerra/needle-llama3-16x512
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
44
•
1
lvwerra/admin
Viewer
•
Updated
Mar 6, 2024
•
1
•
406
lvwerra/stack-exchange-paired
Viewer
•
Updated
Mar 13, 2023
•
31.3M
•
2.61k
•
143
lvwerra/git-commits-clean
Updated
Mar 2, 2023
•
6
lvwerra/changeit
Viewer
•
Updated
Jan 8, 2023
•
31
•
207
lvwerra/code-ml
Viewer
•
Updated
Jan 4, 2023
•
1.5k
•
26
Expand 22 datasets