Pinkstack's picture
Update README.md
d9fe9bd verified
|
raw
history blame
1.02 kB
metadata
license: mit
license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:
  - en
pipeline_tag: text-generation
tags:
  - phi
  - nlp
  - math
  - code
  - chat
  - conversational
  - phi3
inference:
  parameters:
    temperature: 0
widget:
  - messages:
      - role: user
        content: How many R's in strawberry? Think step by step.
library_name: transformers

gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf

Phi-4 Technical Report Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉

Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. Training loss: 0.443800

NOTE: more information soon, gguf

Uploaded model

  • Developed by: Pinkstack
  • License: MIT
  • Finetuned from model : microsoft/phi-4

This phi-4 model was trained with Unsloth and Huggingface's TRL library.