kyujinpy commited on
Commit
bb1898f
ยท
1 Parent(s): 93ab91a

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -40,22 +40,22 @@ I use A100 GPU 40GB and COLAB, when trianing.
40
 
41
  | Model Name | Vocabulary Size | Description |
42
  | --- | --- | --- |
43
- | Original Platypus2 | NaN | Sentencepiece BPE |
44
- | **Expanded KO-Platypus-ex** | NaN | Sentencepiece BPE. Added Korean vocab and merges |
45
 
46
  **Tokenizing "์•ˆ๋…•ํ•˜์„ธ์š”, ์˜ค๋Š˜์€ ๋‚ ์”จ๊ฐ€ ์ข‹๋„ค์š”."**
47
 
48
  | Model | Tokens |
49
  | --- | --- |
50
- | Platypus2-7b | `[NaN]` |
51
- | KO-Platypus2-7b-ex | `[NaN]` |
52
 
53
  **Tokenizing "Platypus: Quick, Cheap, and Powerful Refinement of LLMs"**
54
 
55
  | Model | Tokens |
56
  | --- | --- |
57
- | Platypus2-7b | `[NaN]` |
58
- | KO-Platypus2-7b-ex | `[NaN]` |
59
 
60
  # **Model Benchmark**
61
 
 
40
 
41
  | Model Name | Vocabulary Size | Description |
42
  | --- | --- | --- |
43
+ | Original Platypus2 | 32000 | Sentencepiece BPE |
44
+ | **Expanded KO-Platypus-ex** | 46336 | Sentencepiece BPE. Added Korean vocab and merges |
45
 
46
  **Tokenizing "์•ˆ๋…•ํ•˜์„ธ์š”, ์˜ค๋Š˜์€ ๋‚ ์”จ๊ฐ€ ์ข‹๋„ค์š”."**
47
 
48
  | Model | Tokens |
49
  | --- | --- |
50
+ | Platypus2-7b | `['โ–', '์•ˆ', '<0xEB>', '<0x85>', '<0x95>', 'ํ•˜', '์„ธ', '์š”', ',', 'โ–', '์˜ค', '<0xEB>', '<0x8A>', '<0x98>', '์€', 'โ–', '<0xEB>', '<0x82>', '<0xA0>', '์”จ', '๊ฐ€', 'โ–', '<0xEC>', '<0xA2>', '<0x8B>', '<0xEB>', '<0x84>', '<0xA4>', '์š”', '.']` |
51
+ | KO-Platypus2-7b-ex | `['โ–์•ˆ๋…•', 'ํ•˜์„ธ์š”', ',', 'โ–์˜ค๋Š˜์€', 'โ–๋‚ ', '์”จ๊ฐ€', 'โ–์ข‹๋„ค์š”', '.']` |
52
 
53
  **Tokenizing "Platypus: Quick, Cheap, and Powerful Refinement of LLMs"**
54
 
55
  | Model | Tokens |
56
  | --- | --- |
57
+ | Platypus2-7b | `['โ–Plat', 'yp', 'us', ':', 'โ–Quick', ',', 'โ–Che', 'ap', ',', 'โ–and', 'โ–Power', 'ful', 'โ–Re', 'fin', 'ement', 'โ–of', 'โ–L', 'LM', 's']` |
58
+ | KO-Platypus2-7b-ex | `[โ–Plat', 'yp', 'us', ':', 'โ–Quick', ',', 'โ–Che', 'ap', ',', 'โ–and', 'โ–Power', 'ful', 'โ–Re', 'fin', 'ement', 'โ–of', 'โ–L', 'LM', 's']` |
59
 
60
  # **Model Benchmark**
61