12.4k
Open LLM Leaderboard
π
Track, rank and evaluate open LLMs and chatbots
Evaluate models on key benchmarks. Thanks @clefourrier and @VictorSanh for the recommandations.
Track, rank and evaluate open LLMs and chatbots
Submit code models for evaluation on benchmarks
VLMEvalKit Evaluation Results Collection
Analyze images to detect and label objects
Vote on the latest TTS models!
Generate a 3D leaderboard by voting
Select and filter benchmarks for text embedding tasks
GIFT-Eval: A Benchmark for General Time Series Forecasting
Blind vote on HF TTS models!
Vote on background-removed images to rank models