OpenNLPLab commited on
Commit
60c48a0
Β·
verified Β·
1 Parent(s): b9d0948

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -11
README.md CHANGED
@@ -43,16 +43,21 @@ This official repository unveils the TransNormerLLM3 model along with its open-s
43
 
44
  # Released Weights
45
 
46
- | param | token | Hugging Face | Model Scope | Wisemodel |
47
- | :-----: | :---: | :--------------------------------------------------------------------------------------------------------------------: | :---------: | :-------: |
48
- | **15B** | 50B | πŸ€—[step13000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step13000-50Btokens) | πŸ€– | 🐯 |
49
- | **15B** | 100B | πŸ€—[step26000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step26000-100Btokens) | πŸ€– | 🐯 |
50
- | **15B** | 150B | πŸ€—[step39000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step39000-150Btokens) | πŸ€– | 🐯 |
51
- | **15B** | 200B | πŸ€—[step52000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step52000-200Btokens) | πŸ€– | 🐯 |
52
- | **15B** | 250B | πŸ€—[step65000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step65000-250Btokens) | πŸ€– | 🐯 |
53
- | **15B** | 300B | πŸ€—[step78000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step78000-300Btokens) | πŸ€– | 🐯 |
54
-
55
-
 
 
 
 
 
56
  ```python
57
  from transformers import AutoModelForCausalLM, AutoTokenizer
58
 
@@ -73,7 +78,11 @@ The evaluations of all models are conducted using the official settings and the
73
  | **TransNormerLLM3-15B** | 15 | 0.30 | 67.00 | 76.50 | 67.17 | 64.40 | 66.29 | 36.77 | 38.80 | 33.99 | 37.60 |
74
  | **TransNormerLLM3-15B** | 15 | 0.35 | 65.78 | 75.46 | 67.88 | 66.54 | 67.34 | 38.57 | 39.60 | 36.02 | 39.20 |
75
  | **TransNormerLLM3-15B** | 15 | 0.40 | 67.34 | 75.24 | 68.51 | 66.22 | 68.94 | 40.10 | 39.20 | 41.10 | 39.01 |
76
-
 
 
 
 
77
 
78
  > **P**: parameter size (billion). **T**: tokens (trillion). **BoolQ**: acc. **PIQA**: acc. **HellaSwag**: acc_norm. **WinoGrande**: acc. **ARC-easy**: acc. **ARC-challenge**: acc_norm. **OpenBookQA**: acc_norm. **MMLU**: 5-shot acc. **C-Eval**: 5-shot acc.
79
 
 
43
 
44
  # Released Weights
45
 
46
+ | param | token | Hugging Face | Model Scope | Wisemodel |
47
+ | :-----: | :---: | :----------------------------------------------------------------------------------------------------------------------: | :---------: | :-------: |
48
+ | **15B** | 50B | πŸ€—[step13000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step13000-50Btokens) | πŸ€– | 🐯 |
49
+ | **15B** | 100B | πŸ€—[step26000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step26000-100Btokens) | πŸ€– | 🐯 |
50
+ | **15B** | 150B | πŸ€—[step39000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step39000-150Btokens) | πŸ€– | 🐯 |
51
+ | **15B** | 200B | πŸ€—[step52000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step52000-200Btokens) | πŸ€– | 🐯 |
52
+ | **15B** | 250B | πŸ€—[step65000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step65000-250Btokens) | πŸ€– | 🐯 |
53
+ | **15B** | 300B | πŸ€—[step78000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step78000-300Btokens) | πŸ€– | 🐯 |
54
+ | **15B** | 350B | πŸ€—[step92000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step92000-350Btokens) | πŸ€– | 🐯 |
55
+ | **15B** | 400B | πŸ€—[step105000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step105000-400Btokens) | πŸ€– | 🐯 |
56
+ | **15B** | 450B | πŸ€—[step118000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step118000-450Btokens) | πŸ€– | 🐯 |
57
+ | **15B** | 500B | πŸ€—[step131000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step131000-500Btokens) | πŸ€– | 🐯 |
58
+ | **15B** | 550B | πŸ€—[step144000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step144000-550Btokens) | πŸ€– | 🐯 |
59
+ | **15B** | 600B | πŸ€—[step157000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step157000-600Btokens) | πŸ€– | 🐯 |
60
+ | **15B** | 650B | πŸ€—[step170000](https://huggingface.co/OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints/tree/step170000-650Btokens) | πŸ€– | 🐯 |
61
  ```python
62
  from transformers import AutoModelForCausalLM, AutoTokenizer
63
 
 
78
  | **TransNormerLLM3-15B** | 15 | 0.30 | 67.00 | 76.50 | 67.17 | 64.40 | 66.29 | 36.77 | 38.80 | 33.99 | 37.60 |
79
  | **TransNormerLLM3-15B** | 15 | 0.35 | 65.78 | 75.46 | 67.88 | 66.54 | 67.34 | 38.57 | 39.60 | 36.02 | 39.20 |
80
  | **TransNormerLLM3-15B** | 15 | 0.40 | 67.34 | 75.24 | 68.51 | 66.22 | 68.94 | 40.10 | 39.20 | 41.10 | 39.01 |
81
+ | **TransNormerLLM3-15B** | 15 | 0.45 | 69.02 | 76.28 | 69.11 | 63.77 | 65.82 | 36.01 | 39.40 | 37.17 | 42.80 |
82
+ | **TransNormerLLM3-15B** | 15 | 0.50 | 66.15 | 77.09 | 69.75 | 65.11 | 68.56 | 35.84 | 39.60 | 39.81 | 42.00 |
83
+ | **TransNormerLLM3-15B** | 15 | 0.55 | 70.24 | 74.05 | 69.96 | 65.75 | 65.61 | 36.69 | 38.60 | 40.08 | 44.00 |
84
+ | **TransNormerLLM3-15B** | 15 | 0.60 | 74.34 | 75.68 | 70.44 | 66.22 | 69.36 | 38.40 | 38.40 | 41.05 | 45.30 |
85
+ | **TransNormerLLM3-15B** | 15 | 0.65 | 73.15 | 76.55 | 71.60 | 66.46 | 69.65 | 39.68 | 40.80 | 41.20 | 44.90 |
86
 
87
  > **P**: parameter size (billion). **T**: tokens (trillion). **BoolQ**: acc. **PIQA**: acc. **HellaSwag**: acc_norm. **WinoGrande**: acc. **ARC-easy**: acc. **ARC-challenge**: acc_norm. **OpenBookQA**: acc_norm. **MMLU**: 5-shot acc. **C-Eval**: 5-shot acc.
88