End of training
Browse files
README.md
CHANGED
@@ -1,4 +1,5 @@
|
|
1 |
---
|
|
|
2 |
library_name: transformers
|
3 |
model_name: R2-Q7B-GR1-ALL-s1k-5e-5-weight-decay-1e-4
|
4 |
tags:
|
@@ -10,7 +11,7 @@ licence: license
|
|
10 |
|
11 |
# Model Card for R2-Q7B-GR1-ALL-s1k-5e-5-weight-decay-1e-4
|
12 |
|
13 |
-
This model is a fine-tuned version of [None](https://huggingface.co/None).
|
14 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
15 |
|
16 |
## Quick start
|
|
|
1 |
---
|
2 |
+
datasets: open-r1/numina-ALL-V4-verify-s1k
|
3 |
library_name: transformers
|
4 |
model_name: R2-Q7B-GR1-ALL-s1k-5e-5-weight-decay-1e-4
|
5 |
tags:
|
|
|
11 |
|
12 |
# Model Card for R2-Q7B-GR1-ALL-s1k-5e-5-weight-decay-1e-4
|
13 |
|
14 |
+
This model is a fine-tuned version of [None](https://huggingface.co/None) on the [open-r1/numina-ALL-V4-verify-s1k](https://huggingface.co/datasets/open-r1/numina-ALL-V4-verify-s1k) dataset.
|
15 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
16 |
|
17 |
## Quick start
|