Model Name,Overall Accuracy,Correct Predictions,Total Questions,Timestamp,Team Name Nemotron-Mini-4B-Instruct,35.1,5681,16186,2025-03-26 19:00:45, Llama-3.2-1B-Instruct,34.5,5584,16186,2025-03-26 19:00:45, Gemma-2-2b-it,38.9,6296,16186,2025-03-26 19:00:45, Falcon3-3B-Instruct,42.6,6895,16186,2025-03-26 19:00:45, Granite-3.1-3b-a800m-instruct,43.6,7057,16186,2025-03-26 19:00:45, Llama-3.2-3B-Instruct,50.2,8125,16186,2025-03-26 19:00:45, Granite-3.1-2b-instruct,48.1,7785,16186,2025-03-26 19:00:45, Qwen2.5-1.5B-Instruct,49.7,8044,16186,2025-03-26 19:00:45, Exaone-3.5-2.4B-Instruct,53.7,8692,16186,2025-03-26 19:00:45, Phi-3.5-mini-instruct,63.7,10310,16186,2025-03-26 19:00:45, Qwen2.5-3B-Instruct,68.1,11023,16186,2025-03-26 19:00:45, Olmo-2-1124-7B-Instruct,49.6,8028,16186,2025-03-26 19:00:45, Falcon3-7B-Instruct,52.4,8481,16186,2025-03-26 19:00:45, Falcon3-10B-Instruct,54.3,8789,16186,2025-03-26 19:00:45, Yi-1.5-6B-Chat,60.5,9793,16186,2025-03-26 19:00:45, Llama-3.1-8B-Instruct,66.9,10828,16186,2025-03-26 19:00:45, Granite-3.1-8b-instruct,60.8,9841,16186,2025-03-26 19:00:45, Internlm2.5-7b-chat,64.3,10408,16186,2025-03-26 19:00:45, Ministral-8B-Instruct-2410,71.5,11573,16186,2025-03-26 19:00:45, Yi-1.5-9B-Chat,72.7,11767,16186,2025-03-26 19:00:45, Qwen2.5-7B-Instruct,74.9,12123,16186,2025-03-26 19:00:45, Gemma-2-9b-it,75.0,12140,16186,2025-03-26 19:00:45,Gemma-2-9b-it