Submitted by lambertxiao 32 Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models · 7 authors 29 1
Submitted by yireun 23 EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes · 41 authors 2
Submitted by yilunzhao 9 Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers · 4 authors 2 1
Submitted by smajumdar94 4 OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique · 9 authors 1
Submitted by mgalkin 3 AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs · 5 authors 6 1
Submitted by Ajwad 2 LLMalMorph: On The Feasibility of Generating Variant Malware using Large-Language-Models · 7 authors 1
Submitted by peiranW 1 UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks · 5 authors 1 1
Submitted by SeKim12 1 Taming generative video models for zero-shot optical flow extraction · 11 authors 2 1
Submitted by itay1itzhak 1 Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs · 3 authors 1
Submitted by RanjanSapkota 1 Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning · 4 authors 1
Submitted by cmavro - BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering · 9 authors 1