Next Block Prediction: Video Generation via Semi-Autoregressive Modeling Paper • 2502.07737 • Published 3 days ago • 7
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 4 days ago • 117
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 4 days ago • 27
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 23 days ago • 318
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling Paper • 2501.17811 • Published 16 days ago • 4
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 8 days ago • 48