allenai
/

OLMo-2-0325-32B

Text Generation

Model card Files Files and versions Community

amanrangapur commited on 21 days ago

Commit

ba41e77

·

verified ·

1 Parent(s): fd89b21

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -67,7 +67,7 @@ The quantized model is more sensitive to data types and CUDA operations. To avoi
 inputs.input_ids.to('cuda')
 ```
-We have released checkpoints for these models. For pretraining, the naming convention is `stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
 To load a specific model revision with HuggingFace, simply add the argument `revision`:

 inputs.input_ids.to('cuda')
 ```
+We have released checkpoints for these models. For pretraining, the naming convention is `stage1-stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
 To load a specific model revision with HuggingFace, simply add the argument `revision`: