SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 154
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 22 days ago • 33
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 22 days ago • 35
view post Post 1244 You can now use the "Synthetic Data Generator" at a much larger scale with your preferred inference engine: Ollama, vLLM, TGI, and serverless inference! 🔥Install, configure, launch!Space: argilla/synthetic-data-generatorExamples: https://github.com/argilla-io/synthetic-data-generator/tree/main/examples See translation 👀 4 4 ❤️ 1 1 🔥 1 1 + Reply