InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling Paper • 2304.03544 • Published Apr 7, 2023
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models Paper • 2403.10258 • Published Mar 15, 2024
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper • 2411.06176 • Published Nov 9, 2024 • 45
SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia Paper • 2502.06298 • Published Feb 10 • 1
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 14 days ago • 58
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published 17 days ago • 24
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published 17 days ago • 24
Efficient Diffusion Model for Image Restoration by Residual Shifting Paper • 2403.07319 • Published Mar 12, 2024
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published Jan 2 • 11
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations Paper • 2412.08580 • Published Dec 11, 2024 • 45
MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era Paper • 2406.09121 • Published Jun 13, 2024 • 1
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention Paper • 2406.12718 • Published Jun 18, 2024 • 1
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model Paper • 2402.03631 • Published Feb 6, 2024
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio Paper • 2410.12787 • Published Oct 16, 2024 • 31
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining Paper • 2401.08407 • Published Jan 16, 2024
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision Paper • 2207.02372 • Published Jul 6, 2022
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Paper • 2309.13505 • Published Sep 24, 2023
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Paper • 2309.13505 • Published Sep 24, 2023
Mitigating Object Hallucination via Concentric Causal Attention Paper • 2410.15926 • Published Oct 21, 2024 • 17