IQuestLab/IQuest-Coder-V1-40B-Instruct Text Generation • 40B • Updated about 21 hours ago • 1.89k • 174
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Paper • 2504.11468 • Published Apr 10, 2025 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
Cosmos-Predict2 Collection World Foundation Model for Future Prediction • 13 items • Updated 11 days ago • 33