arcade100k / tokenization_arcade100k.py

Commit History

fix: add missing `errors`
b5b7b34

jon-tow commited on

feat: add pickle support
cd5298a

jon-tow commited on

fix: make `eos_token`/`pad_token` overridable
dfe1f6f

jon-tow commited on

feat: expose tiktoken tokenizer merge ranks and special tokens
ee2a4ae

jon-tow commited on

fix: check surface form against special tokens in `_add_tokens`
9ada655

jon-tow commited on

fix: update license
7e4138f

jon-tow commited on

fix: re-ordering special tokens
f221007

jon-tow commited on

fix: remove `print` debug statements
3c66e0d

jon-tow commited on

Update tokenization_arcade100k.py
8593ce9

jon-tow commited on

fix: create final list of special tokens
9b47601

jon-tow commited on

init: first commit
aceea5b

jon-tow commited on