CircleGuardBench: New Standard for Evaluating AI Moderation Models
By
and 7 others
•
•
47I trained a Language Model to schedule events with GRPO!
By
•
•
61🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?
By
•
•
239Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios
By
and 3 others
•
•
18Page-to-Video: Generate videos from webpages 🪄🎬
By
•
•
16Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs
By
and 1 other
•
•
14AI Personas: The Impact of Design Choices
By
and 1 other
•
•
12Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability
By
and 1 other
•
•
11Uncensor any LLM with abliteration
By
•
•
548DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
130Creating your custom Ghibli Text-to-Image model
By
and 3 others
•
•
15🦸🏻#17: What is A2A and why is it – still! – underappreciated?
By
•
•
8DeepWiki: Best AI Documentation Generator for Any Github Repo
By
•
•
17ColPali: Efficient Document Retrieval with Vision Language Models 👀
By
•
•
246What is test-time compute and how to scale it?
By
and 1 other
•
•
85Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
By
•
•
29Introduction to State Space Models (SSM)
By
•
•
128Code a simple RAG from scratch
By
•
•
67KV Caching Explained: Optimizing Transformer Inference Efficiency
By
•
•
63Reasoning Datasets Competition
By
and 6 others
•
•
33