Topic 27: What are Chain-of-Agents and Chain-of-RAG? By Kseniase and 1 other • about 19 hours ago • 5
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other • 2 days ago • 8
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • 3 days ago • 3
**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation** By TheMindExpansionNetwork • 3 days ago • 2
Announcing the winners of the Frugal AI Challenge 🌱 By frugal-ai-challenge and 1 other • 3 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 3 days ago • 3
From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 3 days ago • 19
Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 4 days ago • 8
Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other • 5 days ago • 9
Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv • 6 days ago • 3
Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI By Duskfallcrew • 6 days ago • 1
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • 7 days ago • 25
Topic 27: What are Chain-of-Agents and Chain-of-RAG? By Kseniase and 1 other • about 19 hours ago • 5
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other • 2 days ago • 8
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • 3 days ago • 3
**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation** By TheMindExpansionNetwork • 3 days ago • 2
Announcing the winners of the Frugal AI Challenge 🌱 By frugal-ai-challenge and 1 other • 3 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 3 days ago • 3
From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 3 days ago • 19
Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 4 days ago • 8
Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other • 5 days ago • 9
Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv • 6 days ago • 3
Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI By Duskfallcrew • 6 days ago • 1
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • 7 days ago • 25