Submitted by akhaliq 55 Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models · 4 authors 6
Submitted by akhaliq 46 ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models · 9 authors 7
Submitted by akhaliq 40 Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization · 4 authors 1
Submitted by akhaliq 28 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training · 8 authors 1
Submitted by akhaliq 27 AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct · 3 authors 9
Submitted by akhaliq 19 CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner · 7 authors 2
Submitted by akhaliq 17 Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach · 13 authors
Submitted by akhaliq 16 Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition · 6 authors
Submitted by akhaliq 15 Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining · 5 authors
Submitted by akhaliq 8 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting · 7 authors