Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 12 days ago • 123
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 10 days ago • 46
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • about 7 hours ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 192
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 3 days ago • 4
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 21 days ago • 33
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 18 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 6 days ago • 3
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 12 days ago • 123
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 10 days ago • 46
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • about 7 hours ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 192
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 3 days ago • 4
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 21 days ago • 33
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 18 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 6 days ago • 3