Reasoning is all you need 🌠 Collection The garage for 14B and 7B reasoning models, listed based on benchmarks. • 24 items • Updated about 6 hours ago • 2
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 1 day ago • 19
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • 6 days ago • 6
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • 15 days ago • 15
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 12 hours ago • 48
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 9 days ago • 50
CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation Paper • 2501.16609 • Published 15 days ago • 6
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published 12 days ago • 16
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 12 days ago • 17
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 13 days ago • 22