YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

The trained BPE tokenziers with various vocabulary sizes, which uses to study how the vocabulary size affects the performance of language models.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including sail/scaling-with-vocab-trained-tokenizers