MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 14
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? Paper • 2309.07462 • Published Sep 14, 2023 • 4