Update README.md
Browse files
README.md
CHANGED
@@ -105,7 +105,7 @@ We compare this to the original R1 model and test in both regimes where repetiti
|
|
105 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.0 | 60 | 94 |
|
106 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.1 | 62 | 96 |
|
107 |
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.0 | 66 | 92 |
|
108 |
-
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.1 | 70 | 98 |
|
109 |
|
110 |
Code for the SakanaAI/gsm8k-ja-test_250-1319 evaluation can be found [here](https://drive.google.com/file/d/1gCzCJv5vasw8R3KVQimfoIDFyfxwxNvC/view?usp=sharing).
|
111 |
|
@@ -118,7 +118,7 @@ This benchmark contains more varied and complex prompts, meaning this is a more
|
|
118 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.0 | 48 |
|
119 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.1 | 48 |
|
120 |
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.0 | 84 |
|
121 |
-
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.1 | 94 |
|
122 |
|
123 |
Code for the DeL-TaiseiOzaki/Tengentoppa-sft-reasoning-ja evaluation can be found [here](https://drive.google.com/file/d/1f75IM5x1SZrb300odkEsLMfKsfibrxvR/view?usp=sharing).
|
124 |
|
|
|
105 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.0 | 60 | 94 |
|
106 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.1 | 62 | 96 |
|
107 |
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.0 | 66 | 92 |
|
108 |
+
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.1 | **70** | **98** |
|
109 |
|
110 |
Code for the SakanaAI/gsm8k-ja-test_250-1319 evaluation can be found [here](https://drive.google.com/file/d/1gCzCJv5vasw8R3KVQimfoIDFyfxwxNvC/view?usp=sharing).
|
111 |
|
|
|
118 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.0 | 48 |
|
119 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 1.1 | 48 |
|
120 |
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.0 | 84 |
|
121 |
+
| lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese | 1.1 | **94** |
|
122 |
|
123 |
Code for the DeL-TaiseiOzaki/Tengentoppa-sft-reasoning-ja evaluation can be found [here](https://drive.google.com/file/d/1f75IM5x1SZrb300odkEsLMfKsfibrxvR/view?usp=sharing).
|
124 |
|