0xZee's picture
Trained with Unsloth
219a3ce verified
metadata
license: apache-2.0
base_model:
  - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
datasets:
  - 0xZee/dataset-CoT-Space-Physics-Astrophysics-76
tags:
  - unsloth
  - trl
  - sft

Finetunning deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B on 0xZee/dataset-CoT-Physics-Astrophysics