Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries Paper • 2502.20475 • Published 14 days ago • 2 • 4
An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published 7 days ago • 8 • 3
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer Paper • 2503.02495 • Published 9 days ago • 8 • 4
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Paper • 2503.04222 • Published 7 days ago • 13 • 3
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 21 days ago • 85 • 9
ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation Paper • 2502.13581 • Published 22 days ago • 5 • 3
Large Language Models and Mathematical Reasoning Failures Paper • 2502.11574 • Published 24 days ago • 3 • 3
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published about 1 month ago • 10 • 4
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published Feb 9 • 34 • 3
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published Feb 7 • 43 • 3
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression Paper • 2502.04296 • Published Feb 6 • 6 • 3
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Paper • 2502.03639 • Published Feb 5 • 9 • 3
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9 • 6
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published Jan 30 • 25 • 3
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published Jan 31 • 10 • 5
Unraveling the Capabilities of Language Models in News Summarization Paper • 2501.18128 • Published Jan 30 • 4 • 3
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published Jan 27 • 18 • 3