Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 15 days ago • 106
Running 2.23k 2.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 27 days ago • 51
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated 21 days ago • 110k • 7.74k • 521