Leaderboards 🔥
A collection of Leaderboards for LLMs ⚡️⚖️ 🤗
- 3.99k🏆
- 12.4k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
- 185
Yet Another LLM Leaderboard
🌖Run a Streamlit web app
- 130
Hallucinations Leaderboard
🔥View and submit LLM evaluations
- 419
LLM-Perf Leaderboard
🏆Explore hardware performance for language models
- 88
LLM Safety Leaderboard
🥇View and submit machine learning model evaluations
- 222
AI2 WildBench Leaderboard (V2)
🦁Display and explore model leaderboards and chat history
- 30
Contextual Leaderboard
🐨 - 4.74k
MTEB Leaderboard
🥇Select and filter benchmarks for text embedding tasks
- 50
Open CoT Leaderboard
🥇Track, rank and evaluate open LLMs' CoT quality
- 276
LLM Performance Leaderboard
🐨View LLM Performance Leaderboard
- 181
BigCodeBench Leaderboard
🥇Explore and analyze code evaluation data
- 57
The timm Leaderboard
🏆Display and analyze PyTorch Image Models leaderboard
- 60
Open FinLLM Leaderboard
🥇Browse and submit large language model evaluations
- 99
Open VLM Video Leaderboard
🌎VLMEvalKit Eval Results in video understanding benchmark
- 38
MEGA-Bench Leaderboard
🥇A leaderboard for multimodal models
- 84
Open LLM Leaderboard Model Comparator
🏆Compare Open LLM Leaderboard results
- 108
Vidore Leaderboard
🥇Display Visual Document Retrieval leaderboard
- 88
Judge Arena
💻Compare AI models by voting on responses
- 605
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
- 8
Keras Chatbot Battle
💬Interact with multiple chatbots simultaneously
- 4
OmniEval
🥇 - 5
OmniEval
🥇Official Leaderboard for OmniEval
open-llm-leaderboard/contents
Viewer • Updated • 3.83k • 18.5k • 14- 62
LeaderboardExplorer
🔎Filter and display leaderboards based on selected criteria
- 262
GAIA Leaderboard
🦾Submit and evaluate models on a leaderboard
m-ric/agents_small_benchmark
Viewer • Updated • 100 • 78 • 10- 287
TTS Spaces Arena
🤗Blind vote on HF TTS models!
- 92
MTEB Arena
⚔Teach, test, evaluate language models with MTEB Arena
- 267
GenAI Arena
📈Realtime Image/Video Gen AI Arena