AlejandroOlmedo
/

OpenThinker-7B-4bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

4-bit precision

Model card Files Files and versions

AlejandroOlmedo commited on Feb 13

Commit

c876735

·

verified ·

1 Parent(s): 054df49

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -14,6 +14,20 @@ model-index:
   results: []
 ---
 # Alejandroolmedo/OpenThinker-7B-4bit-mlx
 The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was

   results: []
 ---
+# **About:**
+**A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.**
+**This model is a fine-tuned version of **[**__Qwen/Qwen2.5-7B-Instruct__**](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)** on the **[**__OpenThoughts-114k dataset__**](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k)** dataset. This model improves upon the **[**__Bespoke-Stratos-7B model__**](https://huggingface.co/bespokelabs/Bespoke-Stratos-7B)**, which used 17k examples (**[**__Bespoke-Stratos-17k dataset__**](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k)**).**
+*Special thanks to the folks at Open Thoughts for fine-tuning this version of Qwen/Qwen2.5-7B-Instruct. More information about it can be found here:*
+[https://huggingface.co/open-thoughts/OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B) (Base Model)
+[**__https://github.com/open-thoughts/open-thoughts__**](https://github.com/open-thoughts/open-thoughts) (Open Thoughts Git Repo)
+I simply converted it to MLX format (using mlx-lm version **0.21.4**.) with a quantization of 4-bit for better performance on Apple Silicon Macs (M1,M2,M3,M4 Chips).
 # Alejandroolmedo/OpenThinker-7B-4bit-mlx
 The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was