Blog, Articles, and discussions

Build an AI Shopping Assistant with Gradio MCP Servers

By July 31, 2025 • 9

Community Articles

view all

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

about 12 hours ago

• 39

Your Own GPU-Powered Image Generator with HF Jobs

•

about 13 hours ago

• 23

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

15 days ago

• 129

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 196

From Zero to MCP: Three Lessons I Learned Building Tools for LLMs

•

1 day ago

• 5

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 322

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 28

Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data

•

6 days ago

• 4

🧠 MyMories `.mmr` – Compressed Memory Recall for LLM Continuity

•

6 days ago

• 4

🕺 Tensor Pose Animation Pipeline

•

6 days ago

• 4

🎭 Music Control Net for Video

•

6 days ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 154

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 352

Provence: efficient and robust context pruning for retrieval-augmented generation

and 3 others •

Jan 28

• 16

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

Jun 11

• 73

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

13 days ago

• 47

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

By June 6, 2025 • 52

KV Cache from scratch in nanoVLM

By June 4, 2025 • 88

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By June 3, 2025 • 212

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By June 3, 2025 guest • 79

CodeAgents + Structure: A Better Way to Execute Actions

By May 28, 2025 • 70

🐯 Liger GRPO meets TRL

By May 25, 2025 guest • 47

Dell Enterprise Hub is all you need to build AI on premises

By May 23, 2025 • 20

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By May 23, 2025 • 152

Exploring Quantization Backends in Diffusers

By May 21, 2025 • 39

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By May 21, 2025 • 197

Microsoft and Hugging Face expand collaboration

By May 19, 2025 • 23

The Transformers Library: standardizing model definitions

By May 15, 2025 • 116

Improving Hugging Face Model Access for Kaggle Users

By May 14, 2025 • 33

Blazingly fast whisper transcriptions with Inference Endpoints

By May 13, 2025 • 72

Community Articles

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

about 12 hours ago

• 39

Your Own GPU-Powered Image Generator with HF Jobs

•

about 13 hours ago

• 23

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

15 days ago

• 129

LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025

and 1 other •

3 days ago

• 14

AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds

and 2 others •

11 days ago

• 16

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 106

Code a simple RAG from scratch

•

Oct 29, 2024

• 136

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 196

From Zero to MCP: Three Lessons I Learned Building Tools for LLMs

•

1 day ago

• 5

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 322

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 28

Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data

•

6 days ago

• 4

🧠 MyMories `.mmr` – Compressed Memory Recall for LLM Continuity

•

6 days ago

• 4

🕺 Tensor Pose Animation Pipeline

•

6 days ago

• 4

🎭 Music Control Net for Video

•

6 days ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 154

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 352

Provence: efficient and robust context pruning for retrieval-augmented generation

and 3 others •

Jan 28

• 16

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

Jun 11

• 73

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

13 days ago

• 47

View all