Blog, Articles, and discussions

AI Agents Are Here. What Now?

By January 13, 2025 • 65

Community Articles

view all

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

3 days ago

• 3

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

•

3 days ago

• 2

Announcing the winners of the Frugal AI Challenge 🌱

and 1 other •

3 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

3 days ago

• 3

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

3 days ago

• 19

Announcing AI Energy Score Ratings

•

3 days ago

• 22

🌁#87: Why DeepResearch Should Be Your New Hire

•

4 days ago

• 4

Prompt Engineering in Multi-Agent Systems with KaibanJS

•

4 days ago

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

4 days ago

• 27

Open R1: Update #2

and 6 others •

4 days ago

• 154

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

and 2 others •

4 days ago

• 8

ROOST: Safety Tooling needs Open Tech🐓🤗

•

4 days ago

• 5

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

and 1 other •

5 days ago

• 9

Struggling to understand enterprise-scale codebase?

•

6 days ago

• 2

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

•

6 days ago

• 3

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

•

6 days ago

• 1

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

7 days ago

• 25

Evaluating Language Model Bias with 🤗 Evaluate

By October 24, 2022 • 3

Ethics and Society Newsletter #1

By September 22, 2022

AI Policy @🤗: Comments on U.S. National AI Research Resource Interim Report

By August 1, 2022

Community Articles

view all

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

and 1 other •

about 19 hours ago

• 5

Adventures in AI

•

about 22 hours ago

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

and 1 other •

2 days ago

• 8

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

3 days ago

• 3

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

•

3 days ago

• 2

Announcing the winners of the Frugal AI Challenge 🌱

and 1 other •

3 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

3 days ago

• 3

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

3 days ago

• 19

Announcing AI Energy Score Ratings

•

3 days ago

• 22

🌁#87: Why DeepResearch Should Be Your New Hire

•

4 days ago

• 4

Prompt Engineering in Multi-Agent Systems with KaibanJS

•

4 days ago

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

4 days ago

• 27

Open R1: Update #2

and 6 others •

4 days ago

• 154

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

and 2 others •

4 days ago

• 8

ROOST: Safety Tooling needs Open Tech🐓🤗

•

4 days ago

• 5

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

and 1 other •

5 days ago

• 9

Struggling to understand enterprise-scale codebase?

•

6 days ago

• 2

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

•

6 days ago

• 3

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

•

6 days ago

• 1

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

7 days ago

• 25

Blog, Articles, and discussions

AI Agents Are Here. What Now?

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Adventures in AI

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation**

Announcing the winners of the Frugal AI Challenge 🌱

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Announcing AI Energy Score Ratings

🌁#87: Why DeepResearch Should Be Your New Hire

Prompt Engineering in Multi-Agent Systems with KaibanJS

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Open R1: Update #2

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

ROOST: Safety Tooling needs Open Tech🐓🤗

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

Struggling to understand enterprise-scale codebase?

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Evaluating Language Model Bias with 🤗 Evaluate

Ethics and Society Newsletter #1

AI Policy @🤗: Comments on U.S. National AI Research Resource Interim Report

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Adventures in AI

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation**

Announcing the winners of the Frugal AI Challenge 🌱

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Announcing AI Energy Score Ratings

🌁#87: Why DeepResearch Should Be Your New Hire

Prompt Engineering in Multi-Agent Systems with KaibanJS

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Open R1: Update #2

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

ROOST: Safety Tooling needs Open Tech🐓🤗

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

Struggling to understand enterprise-scale codebase?

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation