Blog, Articles, and discussions

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

By September 29, 2025 • 13

Community Articles

view all

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

2 days ago

• 12

Code a simple RAG from scratch

•

Oct 29, 2024

• 211

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

3 days ago

• 10

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

3 days ago

• 9

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

4 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 685

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

13 days ago

• 46

Gaia2 Leaderboard Update: New Models and New Observations

and 3 others •

about 21 hours ago

• 6

How to Train an Antibody Developability Model

and 1 other •

16 days ago

• 14

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

10 days ago

• 25

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

7 days ago

• 7

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 176

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 72

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 226

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 76

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 34

VibeGame: Exploring Vibe Coding Games

By September 29, 2025 • 14

Visible Watermarking with Gradio

By September 15, 2025 • 16

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

By September 11, 2025 • 150

Welcome EmbeddingGemma, Google's new efficient embedding model

By September 4, 2025 • 231

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

By September 2, 2025 • 65

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

By August 18, 2025 • 75

MCP for Research: How to Connect AI to Research Tools

By August 18, 2025 • 57

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By August 8, 2025 • 69

Fast LoRA inference for Flux with Diffusers and PEFT

By July 23, 2025 • 48

Arc Virtual Cell Challenge: A Primer

By July 18, 2025 • 59

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By July 1, 2025 • 123

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

By June 19, 2025 • 90

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By June 12, 2025 • 144

Exploring Quantization Backends in Diffusers

By May 21, 2025 • 43

Community Articles

There is no such thing as a tokenizer-free lunch

•

9 days ago

• 69

Model Quality: Hugging Face Is All You Need

•

7 days ago

• 20

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

about 7 hours ago

• 16

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

2 days ago

• 12

Code a simple RAG from scratch

•

Oct 29, 2024

• 211

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

3 days ago

• 10

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

3 days ago

• 9

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

4 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 685

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

13 days ago

• 46

Gaia2 Leaderboard Update: New Models and New Observations

and 3 others •

about 21 hours ago

• 6

How to Train an Antibody Developability Model

and 1 other •

16 days ago

• 14

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

10 days ago

• 25

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

7 days ago

• 7

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 176

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 72

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 226

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 76

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 34

View all