base_model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit | |
tags: | |
- text-generation-inference | |
- transformers | |
- unsloth | |
- llama | |
- trl | |
- prashasst | |
license: apache-2.0 | |
language: | |
- en | |
datasets: | |
- FreedomIntelligence/medical-o1-reasoning-SFT | |
pipeline_tag: text-generation | |
library_name: peft | |
# Uploaded model | |
- **Developed by:** Prashasst | |
- **License:** apache-2.0 | |
- **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit | |