Submitted by ofantomas 99 $\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials · 13 authors 4
Submitted by daixuancheng 88 Instruction Pre-Training: Language Models are Supervised Multitask Learners · 6 authors 25
Submitted by ai-alanov 66 The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing · 4 authors 2
Submitted by Lin-Chen 35 Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs · 9 authors 2
Submitted by KennyUTC 33 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding · 7 authors 1
Submitted by hammh0a 30 Model Merging and Safety Alignment: One Bad Model Spoils the Bunch · 7 authors 1
Submitted by sachit-menon 28 Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities · 3 authors 1
Submitted by dbaranchuk 27 Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps · 4 authors 1
Submitted by whitemetalicdragon 24 GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks · 2 authors 3
Submitted by zhangysk 23 PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents · 16 authors 1
Submitted by Jiayi-Pan 20 DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning · 7 authors 1
Submitted by davanstrien 16 Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models · 7 authors 2
Submitted by ChuangtaoChen-TUM 14 LiveMind: Low-latency Large Language Models with Simultaneous Inference · 6 authors 4
Submitted by GuyYariv 13 Improving Visual Commonsense in Language Models via Multiple Image Generation · 4 authors 2
Submitted by zhangysk 13 Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level · 7 authors 1
Submitted by bdqnghi 11 REPOEXEC: Evaluate Code Generation with a Repository-Level Executable Benchmark · 3 authors 1
Submitted by akhaliq 10 ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning · 5 authors 3
Submitted by gsarti 7 Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation · 4 authors 1
Submitted by sohampnow 7 $τ$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains · 4 authors 1
Submitted by hpzhang 5 A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models · 3 authors 2
Submitted by aluo-x 5 StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images · 6 authors 1
Submitted by dippedrusk 5 From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP · 5 authors 1
Submitted by pmh47 4 Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models · 4 authors 1