Blog, Articles, and discussions

Jupyter Agents: training LLMs to reason with notebooks

By September 10, 2025 • 34

Community Articles

view all

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

10 days ago

• 55

How to Choose the Best Open Source LLM for Your Project in 2025

•

12 days ago

• 69

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

10 days ago

• 17

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

5 days ago

• 10

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

11 days ago

• 95

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

•

4 days ago

• 9

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

4 days ago

• 7

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

2 days ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 219

Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel

and 2 others •

4 days ago

• 5

Finegrain Product Placement LoRA (experiment)

•

3 days ago

• 5

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 360

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 89

Understanding Vector Quantization in VQ-VAE

•

Aug 28, 2024

• 42

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 91

Diffusion Language Models: The New Paradigm

•

Jun 10

• 16

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 92

MCP for Research: How to Connect AI to Research Tools

By August 18, 2025 • 54

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 guest • 34

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By July 29, 2025 • 179

Back to The Future: Evaluating AI Agents on Predicting Future Events

By July 17, 2025 guest • 41

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 67

SmolLM3: smol, multilingual, long-context reasoner

By July 8, 2025 • 675

Efficient MultiModal Data Pipeline

By July 8, 2025 • 55

Gemma 3n fully available in the open-source ecosystem!

By June 26, 2025 • 117

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By May 21, 2025 • 216

Vision Language Models (Better, Faster, Stronger)

By May 12, 2025 • 529

Introducing HELMET

By April 16, 2025 • 37

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

By April 8, 2025 guest • 18

Open R1: How to use OlympicCoder locally for coding?

By March 20, 2025 • 63

Community Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

10 days ago

• 55

How to Choose the Best Open Source LLM for Your Project in 2025

•

12 days ago

• 69

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

10 days ago

• 17

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

5 days ago

• 10

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

11 days ago

• 95

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

•

4 days ago

• 9

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

about 9 hours ago

• 8

How to Train an Antibody Developability Model

and 1 other •

4 days ago

• 8

Code a simple RAG from scratch

•

Oct 29, 2024

• 198

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 70

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

4 days ago

• 7

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

2 days ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 219

Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel

and 2 others •

4 days ago

• 5

Finegrain Product Placement LoRA (experiment)

•

3 days ago

• 5

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 360

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 89

Understanding Vector Quantization in VQ-VAE

•

Aug 28, 2024

• 42

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 91

Diffusion Language Models: The New Paradigm

•

Jun 10

• 16

View all