dbands
/

Qwen2.5-Coder-14B-Instruct-reason-gguf

text-generation-inference

Model card Files Files and versions Community

dbands commited on Feb 9

Commit

6fe960e

·

verified ·

1 Parent(s): 2727b9a

Update README.md

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -9,6 +9,34 @@ tags:
 license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
@@ -19,4 +47,4 @@ language:
 This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 license: apache-2.0
 language:
 - en
+datasets:
+- openai/gsm8k
+---
+# My Reasoning Model
+## System Prompt Format
+Respond in the following format:
+```
+<reasoning>
+...
+</reasoning>
+<answer>
+...
+</answer>
+```
+I fine-tuned the model using `openai/gsm8k`, and to ensure costs do not go insane, I used a single A100.
+```
+Enjoy, but please note that this model is experimental and I used it to define my pipeline.
+I will be testing fine tuning larger more capable models.  I suspect they would add more value in the short term.
 ---
 # Uploaded  model
 This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)