Spaces:
Running
Running
Commit
·
8f9396c
1
Parent(s):
bd9c702
updating readme
Browse files
README.md
CHANGED
@@ -10,12 +10,6 @@ pinned: false
|
|
10 |
license: cc-by-4.0
|
11 |
---
|
12 |
|
13 |
-
## Overview
|
14 |
-
|
15 |
-
This application provides a visual leaderboard for comparing AI model performance on challenging Machine Learning Research Competition problems. It uses Streamlit to create an interactive web interface with filtering options, allowing users to select specific models and tasks for comparison.
|
16 |
-
|
17 |
-
The leaderboard uses the MLRC-BENCH benchmark, which measures what percentage of the top human-to-baseline performance gap an agent can close. Success is defined as achieving at least 5% of the margin by which the top human solution surpasses the baseline.
|
18 |
-
|
19 |
## Installation & Setup
|
20 |
|
21 |
1. Clone the repository
|
|
|
10 |
license: cc-by-4.0
|
11 |
---
|
12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
## Installation & Setup
|
14 |
|
15 |
1. Clone the repository
|