Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a model 4 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

updated a model 5 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

updated a model 42 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

View all activity

Organizations

upvoted a paper 5 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 8 days ago • 24

upvoted 3 articles 18 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

19 days ago

•

99

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

28 days ago

•

82

Article

Shadow AI - Where are the CIOs?

18 days ago

•

31

upvoted a collection 20 days ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 5 days ago • 42

upvoted an article 21 days ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

22 days ago

•

104

upvoted a paper 29 days ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13, 2025 • 15

upvoted 2 articles about 1 month ago

Article

Yay! Organizations can now publish blog Articles

Jan 20, 2025

•

53

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

562

upvoted 3 papers about 1 month ago

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 9

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 23

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 15

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

265

upvoted a paper about 1 month ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 138

upvoted an article about 1 month ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

296

upvoted an article about 2 months ago

Article

Introducing Cogito v2.1

Nov 19, 2025

•

17

upvoted a collection about 2 months ago

Cogito v2.1

2 items • Updated Nov 19, 2025 • 14

upvoted an article 2 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

Sep 16, 2025

•

17

upvoted 2 papers 2 months ago

An efficient probabilistic hardware architecture for diffusion-like models

Paper • 2510.23972 • Published Oct 28, 2025 • 4

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 45