Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,8 @@ datasets:
|
|
8 |
|
9 |
# TinyDeepSeek-JP-1.5B
|
10 |
|
|
|
|
|
11 |
本モデルは, DeepSeek-R1の小型蒸留モデルに日本語を追加学習した[cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese](https://huggingface.co/cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese)に対し、
|
12 |
SakanaAI社が提案した新たな蒸留手法TAIDを適用して小型化したものです.
|
13 |
|
@@ -22,30 +24,6 @@ This model is provided for research and development purposes only and should be
|
|
22 |
|
23 |
### Output Examples
|
24 |
|
25 |
-
<details><summary>Give me a short introduction to large language model.</summary>
|
26 |
-
|
27 |
-
```
|
28 |
-
|
29 |
-
```
|
30 |
-
|
31 |
-
</details>
|
32 |
-
|
33 |
-
|
34 |
-
<details><summary>大規模言語モデルについて教えて。</summary>
|
35 |
-
|
36 |
-
```
|
37 |
-
|
38 |
-
```
|
39 |
-
|
40 |
-
</details>
|
41 |
-
|
42 |
-
<details><summary>A regular hexagon can be divided into six equilateral triangles. If the perimeter of one of the triangles is 21 inches, what is the perimeter, in inches, of the regular hexagon?</summary>
|
43 |
-
|
44 |
-
```
|
45 |
-
|
46 |
-
```
|
47 |
-
|
48 |
-
</details>
|
49 |
|
50 |
|
51 |
|
|
|
8 |
|
9 |
# TinyDeepSeek-JP-1.5B
|
10 |
|
11 |
+
**性能悪い!ボツ!**
|
12 |
+
|
13 |
本モデルは, DeepSeek-R1の小型蒸留モデルに日本語を追加学習した[cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese](https://huggingface.co/cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese)に対し、
|
14 |
SakanaAI社が提案した新たな蒸留手法TAIDを適用して小型化したものです.
|
15 |
|
|
|
24 |
|
25 |
### Output Examples
|
26 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
|
28 |
|
29 |
|