Submitted by djstrong 46 Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation · 5 authors 2
Submitted by QiushiSun 32 AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant · 8 authors 2
Submitted by wanderkid 30 Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction · 10 authors 3
Submitted by akhaliq 23 MarDini: Masked Autoregressive Diffusion for Video Generation at Scale · 15 authors 2
Submitted by Xihc20 19 COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training · 7 authors 5
Submitted by xiaotianhan 19 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation · 7 authors 3
Submitted by NeoZ123 17 LongReward: Improving Long-context Large Language Models with AI Feedback · 10 authors 2
Submitted by phillipinseoul 14 GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation · 3 authors 2
Submitted by Yiyuan 10 Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines · 4 authors 2
Submitted by hywang66 9 LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior · 5 authors 2
Submitted by cmhungsteve 6 EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation · 12 authors 2
Submitted by ljang0 6 VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks · 8 authors 2
Submitted by raymin0223 6 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA · 6 authors 3
Submitted by sergioburdisso 5 Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction · 3 authors 2
Submitted by IAMJB 2 Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation · 4 authors 1
Submitted by dnoever 2 Language Models And A Second Opinion Use Case: The Pocket Professional · 1 authors 2