tttx
/

manual-ttt-problem10-32b-021025-2

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

aadityap commited on 1 day ago

Commit

a86748c

·

verified ·

1 Parent(s): 86818ca

End of training

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,11 +1,13 @@
 ---
 library_name: peft
-license: mit
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
 - trl
 - sft
 - generated_from_trainer
 model-index:
 - name: manual-ttt-problem10-32b-021025-2
   results: []
@@ -16,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 # manual-ttt-problem10-32b-021025-2
-This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) on an unknown dataset.
 ## Model description

 ---
 library_name: peft
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
+- alignment-handbook
 - trl
 - sft
 - generated_from_trainer
+datasets:
+- tttx/ttt-problem-10-continue-step2-master
 model-index:
 - name: manual-ttt-problem10-32b-021025-2
   results: []
 # manual-ttt-problem10-32b-021025-2
+This model is a fine-tuned version of [tttx/models-ttt-problem-10-continue-step1](https://huggingface.co/tttx/models-ttt-problem-10-continue-step1) on the tttx/ttt-problem-10-continue-step2-master dataset.
 ## Model description