Vedant101's picture
Upload fine-tuned medical DeepSeek model
9f698c5 verified
---
language: en
tags:
- medical
- deepseek
- fine-tuned
- clinical-reasoning
license: apache-2.0
datasets:
- FreedomIntelligence/medical-o1-reasoning-SFT
---
# Fine-tuned DeepSeek R1 Model for Medical Reasoning
This model is a fine-tuned version of DeepSeek R1 specialized for medical reasoning and clinical decision-making.
## Training Details
- Base Model: unsloth/DeepSeek-R1-Distill-Llama-8B
- Training Data: Medical reasoning dataset (FreedomIntelligence/medical-o1-reasoning-SFT)
- Fine-tuning Method: LoRA (Low-Rank Adaptation)
- Training Parameters:
- Batch Size: 2
- Learning Rate: 2e-4
- Epochs: 1
- Max Sequence Length: 2048
## Usage
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("Vedant101/fine-tune-deep-seek-r1")
tokenizer = AutoTokenizer.from_pretrained("Vedant101/fine-tune-deep-seek-r1")
```