UW-Madison-Lee-Lab
/

VersaPRM-Math-Subset

Generated from Trainer

Model card Files Files and versions Community

UW-Madison-Lee-Lab commited on 3 days ago

Commit

1cba450

·

verified ·

1 Parent(s): 5218bad

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 # VersaPRM-Math-Subset
-This model is a fine-tuned version of [UW-Madison-Lee-Lab/Llama-PRM800K](https://huggingface.co/UW-Madison-Lee-Lab/Llama-PRM800K) on the math category subset of [UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled](https://huggingface.co/datasets/UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled).
 ## Get rewards
 ```python

 # VersaPRM-Math-Subset
+This model is a fine-tuned version of [UW-Madison-Lee-Lab/Llama-PRM800K](https://huggingface.co/UW-Madison-Lee-Lab/Llama-PRM800K) on the __math category subset__ of [UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled](https://huggingface.co/datasets/UW-Madison-Lee-Lab/MMLU-Pro-CoT-Train-Labeled).
 ## Get rewards
 ```python