mandanya
/

Qwen2.5-Coder-0.5B-LCQ-v1

Model card Files Files and versions Community

mandanya commited on Nov 28, 2024

Commit

6cc5bb7

·

verified ·

1 Parent(s): b9957b8

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -8,4 +8,28 @@ base_model:
 - Qwen/Qwen2.5-Coder-0.5B-Instruct
 tags:
 - code
----

 - Qwen/Qwen2.5-Coder-0.5B-Instruct
 tags:
 - code
+---
+Logseq use clojure script over datalog to interact with notes.
+### Description of the approach
+About 100 copies were collected manually and about 800 more were created on their basis using the Qwen2.5-Coder-7B-Instruct model. The test part of the dataset (about 100 synthetic copies) are run through the model with a system prompt describing the specifics of the queries and validated by the codestral-mamba model.
+### Results
+| model | overal | zero_shot | 1_shot | 3_shot | 5_shot |
+|:-------------------------------|---------:|------------:|---------:|---------:|---------:|
+| Qwen2.5-Coder-0.5B-lcq-2403-v1 | 0.2963 | 0.2963 | nan | nan | nan |
+| Qwen2.5-Coder-7B-Instruct-AWQ | 0.0586 | 0.0247 | 0.0494 | 0.0988 | 0.0617 |
+| gpt-4o | 0.0401 | 0.0123 | 0.0741 | 0.037 | 0.037 |
+| gpt-4o-mini | 0.034 | 0.0123 | 0.0247 | 0.0617 | 0.037 |
+| Qwen2.5-Coder-3B-Instruct | 0.0278 | 0 | 0.0123 | 0.0617 | 0.037 |
+| Qwen2.5-Coder-1.5B-Instruct | 0.0123 | 0 | 0 | 0.0123 | 0.037 |
+| Qwen2.5-Coder-0.5B-Instruct | 0.0031 | 0 | 0 | 0.0123 | 0 |
+### How to use
+I prefer to run model with sglang
+```bash
+python3.11 -m sglang.launch_server \
+--model-path mandanya/Qwen2.5-Coder-0.5B-LCQ-v1 \
+--port 23335 \
+--host 0.0.0.0 \
+--mem-fraction-static 0.5 \
+--served-model-name "Qwen2.5-Coder-0.5B-LCQ-v1"
+```