Spanish and LLM Benchmarks: is MMLU Lost in Translation? Paper • 2406.17789 • Published May 28, 2024 • 1
How Stable is Stable Diffusion under Recursive InPainting (RIP)? Paper • 2407.09549 • Published Jun 27, 2024 • 1
Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal Paper • 2408.16012 • Published Aug 16, 2024
Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail? Paper • 2409.15334 • Published Sep 8, 2024 • 1
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published 27 days ago • 28
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published 27 days ago • 28