Update README.md
Browse files
README.md
CHANGED
@@ -67,7 +67,7 @@ The quantized model is more sensitive to data types and CUDA operations. To avoi
|
|
67 |
inputs.input_ids.to('cuda')
|
68 |
```
|
69 |
|
70 |
-
We have released checkpoints for these models. For pretraining, the naming convention is `stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
|
71 |
|
72 |
|
73 |
To load a specific model revision with HuggingFace, simply add the argument `revision`:
|
|
|
67 |
inputs.input_ids.to('cuda')
|
68 |
```
|
69 |
|
70 |
+
We have released checkpoints for these models. For pretraining, the naming convention is `stage1-stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
|
71 |
|
72 |
|
73 |
To load a specific model revision with HuggingFace, simply add the argument `revision`:
|