Submitted by akhaliq 136 Training Language Models to Self-Correct via Reinforcement Learning · 18 authors 9
Submitted by akhaliq 48 InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning · 11 authors 4
Submitted by CaraJ 37 MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines · 13 authors 2
Submitted by chenmouxiang 26 B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests · 7 authors 2
Submitted by akhaliq 25 Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution · 6 authors 2
Submitted by akhaliq 24 LVCD: Reference-based Lineart Video Colorization with Diffusion Models · 3 authors 7
Submitted by Ksgk-fy 22 Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization · 8 authors 5
Submitted by akhaliq 19 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion · 13 authors 2
Submitted by akhaliq 16 StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation · 5 authors 2
Submitted by akoksal 8 MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions · 6 authors 3
Submitted by akhaliq 5 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt · 4 authors 2
Submitted by akhaliq 5 Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation · 13 authors 2
Submitted by davidchan 2 CLAIR-A: Leveraging Large Language Models to Judge Audio Captions · 4 authors 2