Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
yhavinga
/
dutch-tokenizer-arena
Running

App Files Files Community
1
Fetching metadata from the HF Docker repository...
dutch-tokenizer-arena / utils
Ctrl+K
Ctrl+K
  • 3 contributors
History: 20 commits
yhavinga's picture
yhavinga
Add Llama tokenizer creation for Dutch, English, Code, Markdown and TeX.
c78da21 about 1 year ago
  • byte_util.py
    0 Bytes
    update over 1 year ago
  • character_util.py
    6.92 kB
    add compression leaderboard about 1 year ago
  • compression_util.py
    7.26 kB
    Add Llama tokenizer creation for Dutch, English, Code, Markdown and TeX. about 1 year ago
  • convert_sp_to_json.py
    54 Bytes
    update over 1 year ago
  • fn_util.py
    0 Bytes
    add more tokenizers over 1 year ago
  • lang_util.py
    3.45 kB
    add compression leaderboard about 1 year ago
  • lang_util_2.py
    3.05 kB
    update about 1 year ago
  • log_util.py
    285 Bytes
    update over 1 year ago
  • oov_util.py
    265 Bytes
    update over 1 year ago
  • speed_util.py
    77 Bytes
    update over 1 year ago
  • symbol.py
    1.28 kB
    update over 1 year ago
  • text_util.py
    671 Bytes
    add compression leaderboard about 1 year ago
  • vocab.jd.txt.v2
    47.7 kB
    update over 1 year ago