tttx
/

ttt-problem10-32b-021025-sl25000

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

aadityap commited on Feb 11

Commit

f134e72

·

verified ·

1 Parent(s): 9a7e1ab

End of training

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -3,9 +3,14 @@ library_name: peft
 license: mit
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
 - trl
 - sft
 - generated_from_trainer
 model-index:
 - name: ttt-problem10-32b-021025-sl25000
   results: []
@@ -16,7 +21,7 @@ should probably proofread and complete it, then remove this comment. -->
 # ttt-problem10-32b-021025-sl25000
-This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) on the None dataset.
 ## Model description

 license: mit
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
+- alignment-handbook
 - trl
 - sft
 - generated_from_trainer
+datasets:
+- tttx/star-run-step1-master
+- tttx/ttt-problem-10-continue-step1-master
+- tttx/ttt-problem-10-continue-step2-master
 model-index:
 - name: ttt-problem10-32b-021025-sl25000
   results: []
 # ttt-problem10-32b-021025-sl25000
+This model is a fine-tuned version of [tttx/sft-32b-020925-19k-5ep](https://huggingface.co/tttx/sft-32b-020925-19k-5ep) on the tttx/star-run-step1-master, the tttx/ttt-problem-10-continue-step1-master and the tttx/ttt-problem-10-continue-step2-master datasets.
 ## Model description