view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 8 days ago β’ 92
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper β’ 2501.18512 β’ Published 12 days ago β’ 25
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated 1 day ago β’ 90
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 20 days ago β’ 315
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 21 days ago β’ 30
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others β’ 22 days ago β’ 33
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published Jan 4 β’ 90
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published Jan 8 β’ 89
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. β’ 3 items β’ Updated Dec 20, 2024 β’ 8