Submitted by akhaliq 48 Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models · 20 authors 2
Submitted by akhaliq 26 Instruction-tuned Language Models are Better Knowledge Learners · 9 authors 1
Submitted by akhaliq 24 VideoPrism: A Foundational Visual Encoder for Video Understanding · 19 authors 2
Submitted by akhaliq 21 The FinBen: An Holistic Financial Benchmark for Large Language Models · 34 authors 5
Submitted by akhaliq 19 Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields · 3 authors 1
Submitted by akhaliq 18 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction · 9 authors 4
Submitted by akhaliq 15 A Touch, Vision, and Language Dataset for Multimodal Alignment · 10 authors 1
Submitted by akhaliq 14 How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts · 4 authors 3
Submitted by akhaliq 12 TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization · 14 authors 4
Submitted by akhaliq 9 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models · 10 authors 1