On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published 3 days ago • 41
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 20 days ago • 37
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Paper • 2411.17188 • Published Nov 26, 2024 • 22