Spaces:
Running
Running
File size: 1,946 Bytes
2926cc4 697ae1d ed2eb44 2926cc4 ed2eb44 17ad9a6 ed2eb44 17ad9a6 ed2eb44 17ad9a6 ed2eb44 17ad9a6 ed2eb44 eea50e2 ed2eb44 eea50e2 ed2eb44 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 |
---
title: MLRC-BENCH
emoji: 📊
colorFrom: green
colorTo: blue
sdk: streamlit
sdk_version: 1.39.0
app_file: app.py
pinned: false
license: cc-by-4.0
---
## Installation & Setup
1. Clone the repository
```bash
git clone https://huggingface.co/spaces/launch/MLRC_Bench
cd MLRC_Bench
```
2. Setup virtual env and install the required dependencies
```bash
python -m venv env
source env/bin/activate
pip install -r requirements.txt
```
3. Run the application
```bash
streamlit run app.py
```
### Updating Metrics
To update the table, update the respective metric file in `src/data/metrics` directory
### Updating Text
To update the tab on Benchmark details, make changes to the the following file - `src/components/tasks.py`
To update the metric definitions, make changes to the following file - `src/components/tasks.py`
### Adding New Metrics
To add a new metric:
1. Create a new JSON data file in the `src/data/metrics/` directory (e.g., `src/data/metrics/new_metric.json`)
2. Update `metrics_config` in `src/utils/config.py`:
```python
metrics_config = {
"Margin to Human": { ... },
"New Metric Name": {
"file": "src/data/metrics/new_metric.json",
"description": "Description of the new metric",
"min_value": 0,
"max_value": 100,
"color_map": "viridis"
}
}
```
3. Ensure your metric JSON file follows the same format as existing metrics:
```json
{
"task-name": {
"model-name-1": value,
"model-name-2": value
},
"another-task": {
"model-name-1": value,
"model-name-2": value
}
}
```
### Adding New Agent Types
To add new agent types:
1. Update `model_categories` in `src/utils/config.py`:
```python
model_categories = {
"Existing Model": "Category",
"New Model Name": "New Category"
}
```
## License
[MIT License](LICENSE)
|