CuckmeisterFuller commited on
Commit
b3ddb2a
·
verified ·
1 Parent(s): 9f375a5

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - fr
5
+ - de
6
+ - es
7
+ - it
8
+ - pt
9
+ - zh
10
+ - ja
11
+ - ru
12
+ - ko
13
+ license: apache-2.0
14
+ library_name: vllm
15
+ base_model: mlx-community/Mistral-Small-24B-Instruct-2501-4bit
16
+ extra_gated_description: If you want to learn more about how we process your personal
17
+ data, please read our <a href="https://mistral.ai/terms/">Privacy Policy</a>.
18
+ tags:
19
+ - mlx
20
+ - mlx
21
+ - mlx-my-repo
22
+ ---
23
+
24
+ # CuckmeisterFuller/Mistral-Small-24B-Instruct-2501-4bit-Q2-mlx
25
+
26
+ The Model [CuckmeisterFuller/Mistral-Small-24B-Instruct-2501-4bit-Q2-mlx](https://huggingface.co/CuckmeisterFuller/Mistral-Small-24B-Instruct-2501-4bit-Q2-mlx) was converted to MLX format from [mlx-community/Mistral-Small-24B-Instruct-2501-4bit](https://huggingface.co/mlx-community/Mistral-Small-24B-Instruct-2501-4bit) using mlx-lm version **0.20.5**.
27
+
28
+ ## Use with mlx
29
+
30
+ ```bash
31
+ pip install mlx-lm
32
+ ```
33
+
34
+ ```python
35
+ from mlx_lm import load, generate
36
+
37
+ model, tokenizer = load("CuckmeisterFuller/Mistral-Small-24B-Instruct-2501-4bit-Q2-mlx")
38
+
39
+ prompt="hello"
40
+
41
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
42
+ messages = [{"role": "user", "content": prompt}]
43
+ prompt = tokenizer.apply_chat_template(
44
+ messages, tokenize=False, add_generation_prompt=True
45
+ )
46
+
47
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
48
+ ```