Text Generation
Transformers
Safetensors
English
qwen2
conversational
text-generation-inference

Model Card for Model ID

Model trained based on deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B by Self-Calibration proposed by Efficient Test-Time Scaling via Self-Calibration.

Model Sources

Citation

BibTeX:

@misc{huang2025efficienttesttimescalingselfcalibration,
      title={Efficient Test-Time Scaling via Self-Calibration}, 
      author={Chengsong Huang and Langlin Huang and Jixuan Leng and Jiacheng Liu and Jiaxin Huang},
      year={2025},
      eprint={2503.00031},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2503.00031}, 
}

Model Card Contact

[email protected]

Downloads last month
121
Safetensors
Model size
1.78B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration

Finetuned
(207)
this model
Quantizations
1 model

Dataset used to train HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration

Collection including HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration