tttx
/

ttt-problem10-32b-021025-sl25000

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

aadityap commited on Feb 11

Commit

7491a01

·

verified ·

1 Parent(s): 8567d52

End of training

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -3,9 +3,12 @@ library_name: peft
 license: mit
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
 - trl
 - sft
 - generated_from_trainer
 model-index:
 - name: ttt-problem10-32b-021025-sl25000
   results: []
@@ -16,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 # ttt-problem10-32b-021025-sl25000
-This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) on an unknown dataset.
 ## Model description

 license: mit
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
+- alignment-handbook
 - trl
 - sft
 - generated_from_trainer
+datasets:
+- tttx/p10-ttt-overnight-3-step2-collated
 model-index:
 - name: ttt-problem10-32b-021025-sl25000
   results: []
 # ttt-problem10-32b-021025-sl25000
+This model is a fine-tuned version of [tttx/models-p10-ttt-overnight-3-step1](https://huggingface.co/tttx/models-p10-ttt-overnight-3-step1) on the tttx/p10-ttt-overnight-3-step2-collated dataset.
 ## Model description