Submitted by akhaliq 55 DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search · 17 authors 3
Submitted by zhangysk 35 I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm · 12 authors 2
Submitted by Zigeng 19 Heavy Labels Out! Dataset Distillation with Label Space Lightening · 5 authors 2
Submitted by akhaliq 17 FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance · 7 authors 3
Submitted by akhaliq 16 Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability · 31 authors 2
Submitted by akhaliq 13 BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts · 11 authors 3
Submitted by IAMJB 11 The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community · 3 authors 1
Submitted by akhaliq 11 Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization · 3 authors 4
Submitted by akhaliq 9 MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing · 5 authors 2