Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,28 @@ base_model:
|
|
8 |
- Qwen/Qwen2.5-Coder-0.5B-Instruct
|
9 |
tags:
|
10 |
- code
|
11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
- Qwen/Qwen2.5-Coder-0.5B-Instruct
|
9 |
tags:
|
10 |
- code
|
11 |
+
---
|
12 |
+
|
13 |
+
Logseq use clojure script over datalog to interact with notes.
|
14 |
+
### Description of the approach
|
15 |
+
About 100 copies were collected manually and about 800 more were created on their basis using the Qwen2.5-Coder-7B-Instruct model. The test part of the dataset (about 100 synthetic copies) are run through the model with a system prompt describing the specifics of the queries and validated by the codestral-mamba model.
|
16 |
+
### Results
|
17 |
+
| model | overal | zero_shot | 1_shot | 3_shot | 5_shot |
|
18 |
+
|:-------------------------------|---------:|------------:|---------:|---------:|---------:|
|
19 |
+
| Qwen2.5-Coder-0.5B-lcq-2403-v1 | 0.2963 | 0.2963 | nan | nan | nan |
|
20 |
+
| Qwen2.5-Coder-7B-Instruct-AWQ | 0.0586 | 0.0247 | 0.0494 | 0.0988 | 0.0617 |
|
21 |
+
| gpt-4o | 0.0401 | 0.0123 | 0.0741 | 0.037 | 0.037 |
|
22 |
+
| gpt-4o-mini | 0.034 | 0.0123 | 0.0247 | 0.0617 | 0.037 |
|
23 |
+
| Qwen2.5-Coder-3B-Instruct | 0.0278 | 0 | 0.0123 | 0.0617 | 0.037 |
|
24 |
+
| Qwen2.5-Coder-1.5B-Instruct | 0.0123 | 0 | 0 | 0.0123 | 0.037 |
|
25 |
+
| Qwen2.5-Coder-0.5B-Instruct | 0.0031 | 0 | 0 | 0.0123 | 0 |
|
26 |
+
### How to use
|
27 |
+
I prefer to run model with sglang
|
28 |
+
```bash
|
29 |
+
python3.11 -m sglang.launch_server \
|
30 |
+
--model-path mandanya/Qwen2.5-Coder-0.5B-LCQ-v1 \
|
31 |
+
--port 23335 \
|
32 |
+
--host 0.0.0.0 \
|
33 |
+
--mem-fraction-static 0.5 \
|
34 |
+
--served-model-name "Qwen2.5-Coder-0.5B-LCQ-v1"
|
35 |
+
```
|