fix formatting
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ Note that the vocabulary size of this model (50275) differs from the original `l
|
|
11 |
`<|par|>, </|title|>, </|sec|>, <|sec-title|>, <|sent|>, <|title|>, <|abs|>, <|sec|>, </|sec-title|>, </|abs|>`.
|
12 |
|
13 |
Transferring the checkpoint weights and saving the model was done based on [this code](https://github.com/dwadden/multivers/blob/a6ce033f0e17ae38c1f102eae1ee4ca213fbbe2e/multivers/model.py#L145) from the MultiVerS repository, the versions of `transformers==4.2.2` and `torch==1.7.1` correspond to the MultiVerS [requirements.txt](https://github.com/dwadden/multivers/blob/main/requirements.txt):
|
14 |
-
```
|
15 |
import os
|
16 |
import pathlib
|
17 |
import subprocess
|
@@ -57,7 +57,7 @@ model.save_pretrained(model_dir)
|
|
57 |
```
|
58 |
|
59 |
The tokenizer was resized and saved following [this code](https://github.com/dwadden/multivers/blob/a6ce033f0e17ae38c1f102eae1ee4ca213fbbe2e/multivers/data.py#L14) from the MultiVerS repository:
|
60 |
-
```
|
61 |
from transformers import AutoTokenizer
|
62 |
|
63 |
tokenizer = AutoTokenizer.from_pretrained("allenai/longformer-large-4096")
|
|
|
11 |
`<|par|>, </|title|>, </|sec|>, <|sec-title|>, <|sent|>, <|title|>, <|abs|>, <|sec|>, </|sec-title|>, </|abs|>`.
|
12 |
|
13 |
Transferring the checkpoint weights and saving the model was done based on [this code](https://github.com/dwadden/multivers/blob/a6ce033f0e17ae38c1f102eae1ee4ca213fbbe2e/multivers/model.py#L145) from the MultiVerS repository, the versions of `transformers==4.2.2` and `torch==1.7.1` correspond to the MultiVerS [requirements.txt](https://github.com/dwadden/multivers/blob/main/requirements.txt):
|
14 |
+
```python
|
15 |
import os
|
16 |
import pathlib
|
17 |
import subprocess
|
|
|
57 |
```
|
58 |
|
59 |
The tokenizer was resized and saved following [this code](https://github.com/dwadden/multivers/blob/a6ce033f0e17ae38c1f102eae1ee4ca213fbbe2e/multivers/data.py#L14) from the MultiVerS repository:
|
60 |
+
```python
|
61 |
from transformers import AutoTokenizer
|
62 |
|
63 |
tokenizer = AutoTokenizer.from_pretrained("allenai/longformer-large-4096")
|