DontPlanToEnd commited on
Commit
a43dab5
·
verified ·
1 Parent(s): 137b3a2

Upload ugi-leaderboard-data.csv

Browse files
Files changed (1) hide show
  1. ugi-leaderboard-data.csv +12 -0
ugi-leaderboard-data.csv CHANGED
@@ -708,3 +708,15 @@ soob3123/Veritas-12B,https://huggingface.co/soob3123/Veritas-12B,4/22/2025,4/26/
708
  zelk12/MT-Gen13-gemma-2-9B,https://huggingface.co/zelk12/MT-Gen13-gemma-2-9B,4/26/2025,4/26/2025,gemma-2,9.0,9.0,9.0,True,True,False,30.14,7.0,6.0,8.0,12.89,8,2.8,2.0,2.5,-14.1%,57.9%,50.0%,45.3%,62.9%,40.6%,65.2%,55.8%,45.0%,42.7%,38.5%,43.8%,50.4%,41.7%,59.2%,66.0%,63.3%,Liberalism
709
  zelk12/MT1-Gen13-gemma-2-9B,https://huggingface.co/zelk12/MT1-Gen13-gemma-2-9B,4/26/2025,4/26/2025,gemma-2,9.0,9.0,9.0,True,True,False,31.97,8.0,6.0,10.0,17.7,8,2.2,2.5,2.3,-13.0%,59.9%,52.2%,40.0%,61.6%,39.4%,61.7%,57.7%,42.3%,39.8%,38.3%,41.7%,41.7%,36.7%,62.1%,62.3%,60.4%,Liberalism
710
  cognitivecomputations/Dolphin3.0-Mistral-24B,https://huggingface.co/cognitivecomputations/Dolphin3.0-Mistral-24B,2/2/2025,4/26/2025,chatml,24.0,24.0,24.0,True,False,False,31.43,6.5,7.0,6.0,24.26,14,4.4,1.9,3.4,-24.7%,74.2%,48.3%,45.9%,63.4%,35.8%,66.5%,47.3%,20.8%,35.8%,20.6%,48.3%,53.1%,36.2%,60.2%,58.5%,71.5%,Liberalism
 
 
 
 
 
 
 
 
 
 
 
 
 
708
  zelk12/MT-Gen13-gemma-2-9B,https://huggingface.co/zelk12/MT-Gen13-gemma-2-9B,4/26/2025,4/26/2025,gemma-2,9.0,9.0,9.0,True,True,False,30.14,7.0,6.0,8.0,12.89,8,2.8,2.0,2.5,-14.1%,57.9%,50.0%,45.3%,62.9%,40.6%,65.2%,55.8%,45.0%,42.7%,38.5%,43.8%,50.4%,41.7%,59.2%,66.0%,63.3%,Liberalism
709
  zelk12/MT1-Gen13-gemma-2-9B,https://huggingface.co/zelk12/MT1-Gen13-gemma-2-9B,4/26/2025,4/26/2025,gemma-2,9.0,9.0,9.0,True,True,False,31.97,8.0,6.0,10.0,17.7,8,2.2,2.5,2.3,-13.0%,59.9%,52.2%,40.0%,61.6%,39.4%,61.7%,57.7%,42.3%,39.8%,38.3%,41.7%,41.7%,36.7%,62.1%,62.3%,60.4%,Liberalism
710
  cognitivecomputations/Dolphin3.0-Mistral-24B,https://huggingface.co/cognitivecomputations/Dolphin3.0-Mistral-24B,2/2/2025,4/26/2025,chatml,24.0,24.0,24.0,True,False,False,31.43,6.5,7.0,6.0,24.26,14,4.4,1.9,3.4,-24.7%,74.2%,48.3%,45.9%,63.4%,35.8%,66.5%,47.3%,20.8%,35.8%,20.6%,48.3%,53.1%,36.2%,60.2%,58.5%,71.5%,Liberalism
711
+ Qwen/Qwen3-32B (thinking=disabled),https://huggingface.co/Qwen/Qwen3-32B (thinking=disabled),4/29/2025,4/30/2025,chatml,32.0,32.0,32.0,False,False,True,26.54,3.0,3.0,3.0,19.39,24,3.6,3.0,3.1,-18.2%,64.4%,48.9%,47.6%,60.9%,39.8%,60.2%,46.7%,34.4%,37.9%,34.6%,54.0%,53.1%,35.6%,57.3%,60.4%,65.0%,Liberalism
712
+ yamatazen/Gemma2-Snowflakes-9B,https://huggingface.co/yamatazen/Gemma2-Snowflakes-9B,4/26/2025,4/30/2025,gemma-2,9.0,9.0,9.0,True,True,False,31.55,6.5,7.0,6.0,16.94,4,3.8,2.2,3.1,7.3%,45.6%,51.5%,41.7%,53.9%,34.8%,67.7%,57.1%,60.2%,40.8%,62.3%,34.8%,38.3%,52.1%,57.7%,44.8%,59.2%,Centrism
713
+ yamatazen/HMS-Slerp-12B-v2,https://huggingface.co/yamatazen/HMS-Slerp-12B-v2,4/27/2025,4/30/2025,chatml,12.0,12.0,12.0,True,True,False,31.58,7.5,7.0,8.0,20.54,20,3.2,2.0,3.2,-15.6%,62.9%,43.6%,46.5%,61.3%,51.5%,63.8%,46.0%,39.6%,34.6%,37.1%,53.1%,50.2%,36.2%,58.3%,60.4%,65.2%,Liberalism
714
+ DoppelReflEx/MiniusLight-24B-v2.1 (chatml),https://huggingface.co/DoppelReflEx/MiniusLight-24B-v2.1,4/27/2025,4/30/2025,chatml,24.0,24.0,24.0,True,True,False,34.65,5.0,4.0,6.0,33.4,20,4.0,3.7,3.1,-13.7%,65.3%,43.7%,36.2%,65.1%,50.6%,62.1%,43.8%,35.8%,35.0%,33.3%,35.6%,39.6%,33.3%,69.6%,64.0%,61.7%,Liberalism
715
+ DoppelReflEx/MiniusLight-24B-v2.1 (mistral - V7-Tekken),https://huggingface.co/DoppelReflEx/MiniusLight-24B-v2.1,4/27/2025,4/30/2025,mistral - V7-Tekken,24.0,24.0,24.0,True,True,False,29.35,4.5,4.0,5.0,30.83,20,3.4,3.4,2.2,-14.3%,66.1%,45.5%,40.4%,57.6%,45.2%,66.9%,48.5%,33.8%,36.0%,31.9%,42.9%,47.1%,31.2%,51.0%,63.5%,58.1%,Liberalism
716
+ Qwen/Qwen3-8B (thinking=disabled),https://huggingface.co/Qwen/Qwen3-8B (thinking=disabled),4/29/2025,4/30/2025,chatml,8.0,8.0,8.0,False,False,True,26.28,7.0,5.0,9.0,14.28,8,2.8,1.9,1.6,-14.4%,61.1%,48.8%,40.4%,59.0%,42.5%,62.5%,51.2%,39.0%,39.0%,38.8%,47.5%,45.2%,28.5%,52.7%,57.7%,66.5%,Liberalism
717
+ Qwen/Qwen3-4B (thinking=disabled),https://huggingface.co/Qwen/Qwen3-4B (thinking=disabled),4/29/2025,4/30/2025,chatml,4.0,4.0,4.0,False,False,True,21.82,6.0,5.0,7.0,9.79,4,2.5,1.2,2.0,-13.5%,64.0%,49.3%,48.6%,55.1%,45.4%,61.5%,54.8%,32.7%,48.3%,27.1%,48.8%,49.4%,47.7%,40.0%,58.1%,67.1%,Liberalism
718
+ Qwen/Qwen3-1.7B (thinking=disabled),https://huggingface.co/Qwen/Qwen3-1.7B (thinking=disabled),4/29/2025,4/30/2025,chatml,1.7,1.7,1.7,False,False,True,20.34,7.0,5.0,9.0,7.0,0,1.9,0.8,0.9,-22.9%,64.6%,64.0%,62.3%,59.5%,32.5%,31.7%,56.2%,18.3%,50.8%,37.1%,65.8%,64.4%,56.7%,45.8%,65.8%,66.9%,Technocracy
719
+ Qwen/Qwen3-0.6B (thinking=disabled),https://huggingface.co/Qwen/Qwen3-0.6B (thinking=disabled),4/29/2025,4/30/2025,chatml,0.6,0.6,0.6,False,False,True,19.28,6.5,5.0,8.0,3.42,0,1.2,1.0,1.3,-14.5%,64.0%,44.1%,53.1%,57.1%,39.6%,67.7%,39.6%,31.0%,37.1%,39.8%,53.5%,63.8%,42.1%,48.3%,50.2%,72.9%,Liberalism
720
+ cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition,https://huggingface.co/cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition,4/16/2025,4/30/2025,mistral - V7-Tekken,24.0,24.0,24.0,True,False,False,36.83,8.5,9.0,8.0,24.19,24,2.8,3.0,3.4,-15.8%,57.9%,47.3%,44.4%,60.3%,46.2%,62.5%,50.6%,39.8%,50.6%,36.0%,52.7%,46.7%,33.8%,52.5%,60.2%,68.1%,Liberalism
721
+ darkc0de/XortronCriminalComputing,https://huggingface.co/darkc0de/XortronCriminalComputing,4/30/2025,4/30/2025,mistral - V7-Tekken,24.0,24.0,24.0,True,False,False,36.83,8.5,9.0,8.0,24.19,24,2.8,3.0,3.4,-15.8%,57.9%,47.3%,44.4%,60.3%,46.2%,62.5%,50.6%,39.8%,50.6%,36.0%,52.7%,46.7%,33.8%,52.5%,60.2%,68.1%,Liberalism
722
+ allenai/OLMo-2-0325-32B-Instruct,https://huggingface.co/allenai/OLMo-2-0325-32B-Instruct,3/12/2025,4/30/2025,OLMo-2,32.0,32.0,32.0,False,False,True,27.36,2.0,4.0,0.0,30.38,8,2.5,2.9,4.9,-20.9%,64.2%,46.7%,43.6%,66.0%,47.5%,64.0%,51.5%,37.3%,37.1%,32.9%,45.4%,52.3%,33.1%,67.7%,60.8%,69.6%,Liberalism