Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,20 @@ model-index:
|
|
14 |
results: []
|
15 |
---
|
16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
# Alejandroolmedo/OpenThinker-7B-4bit-mlx
|
18 |
|
19 |
The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was
|
|
|
14 |
results: []
|
15 |
---
|
16 |
|
17 |
+
# **About:**
|
18 |
+
|
19 |
+
**A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.**
|
20 |
+
|
21 |
+
**This model is a fine-tuned version of **[**__Qwen/Qwen2.5-7B-Instruct__**](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)** on the **[**__OpenThoughts-114k dataset__**](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k)** dataset. This model improves upon the **[**__Bespoke-Stratos-7B model__**](https://huggingface.co/bespokelabs/Bespoke-Stratos-7B)**, which used 17k examples (**[**__Bespoke-Stratos-17k dataset__**](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k)**).**
|
22 |
+
|
23 |
+
*Special thanks to the folks at Open Thoughts for fine-tuning this version of Qwen/Qwen2.5-7B-Instruct. More information about it can be found here:*
|
24 |
+
|
25 |
+
[https://huggingface.co/open-thoughts/OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B) (Base Model)
|
26 |
+
|
27 |
+
[**__https://github.com/open-thoughts/open-thoughts__**](https://github.com/open-thoughts/open-thoughts) (Open Thoughts Git Repo)
|
28 |
+
|
29 |
+
I simply converted it to MLX format (using mlx-lm version **0.21.4**.) with a quantization of 4-bit for better performance on Apple Silicon Macs (M1,M2,M3,M4 Chips).
|
30 |
+
|
31 |
# Alejandroolmedo/OpenThinker-7B-4bit-mlx
|
32 |
|
33 |
The Model [Alejandroolmedo/OpenThinker-7B-4bit-mlx](https://huggingface.co/Alejandroolmedo/OpenThinker-7B-4bit-mlx) was
|