AlejandroOlmedo commited on
Commit
c876735
·
verified ·
1 Parent(s): 054df49

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -14,6 +14,20 @@ model-index:
14
  results: []
15
  ---
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  # Alejandroolmedo/OpenThinker-7B-4bit-mlx
18
 
19
  The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was
 
14
  results: []
15
  ---
16
 
17
+ # **About:**
18
+
19
+ **A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.**
20
+
21
+ **This model is a fine-tuned version of **[**__Qwen/Qwen2.5-7B-Instruct__**](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)** on the **[**__OpenThoughts-114k dataset__**](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k)** dataset. This model improves upon the **[**__Bespoke-Stratos-7B model__**](https://huggingface.co/bespokelabs/Bespoke-Stratos-7B)**, which used 17k examples (**[**__Bespoke-Stratos-17k dataset__**](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k)**).**
22
+
23
+ *Special thanks to the folks at Open Thoughts for fine-tuning this version of Qwen/Qwen2.5-7B-Instruct. More information about it can be found here:*
24
+
25
+ [https://huggingface.co/open-thoughts/OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B) (Base Model)
26
+
27
+ [**__https://github.com/open-thoughts/open-thoughts__**](https://github.com/open-thoughts/open-thoughts) (Open Thoughts Git Repo)
28
+
29
+ I simply converted it to MLX format (using mlx-lm version **0.21.4**.) with a quantization of 4-bit for better performance on Apple Silicon Macs (M1,M2,M3,M4 Chips).
30
+
31
  # Alejandroolmedo/OpenThinker-7B-4bit-mlx
32
 
33
  The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was