Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
# VersaPRM-Math-Subset
|
16 |
|
17 |
-
This model is a fine-tuned version of [UW-Madison-Lee-Lab/Llama-PRM800K](https://huggingface.co/UW-Madison-Lee-Lab/Llama-PRM800K) on the
|
18 |
|
19 |
## Get rewards
|
20 |
```python
|
|
|
14 |
|
15 |
# VersaPRM-Math-Subset
|
16 |
|
17 |
+
This model is a fine-tuned version of [UW-Madison-Lee-Lab/Llama-PRM800K](https://huggingface.co/UW-Madison-Lee-Lab/Llama-PRM800K) on the __math category subset__ of [UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled](https://huggingface.co/datasets/UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled).
|
18 |
|
19 |
## Get rewards
|
20 |
```python
|