metadata
license: mit
license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:
- en
pipeline_tag: text-generation
tags:
- phi
- nlp
- math
- code
- chat
- conversational
- phi3
inference:
parameters:
temperature: 0
widget:
- messages:
- role: user
content: How many R's in strawberry? Think step by step.
library_name: transformers
gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf
Phi-4 Technical Report Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉
Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. Training loss: 0.443800
NOTE: more information soon, gguf
Uploaded model
- Developed by: Pinkstack
- License: MIT
- Finetuned from model : microsoft/phi-4
This phi-4 model was trained with Unsloth and Huggingface's TRL library.