Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
xu-song
/
tokenizer-arena
like
56
Running
App
Files
Files
Community
1
988921c
tokenizer-arena
/
vocab
/
glm_chinese
2 contributors
History:
2 commits
xu-song
add compress rate
814ee6b
7 months ago
chinese_sentencepiece
update
about 1 year ago
README.md
Safe
487 Bytes
update
about 1 year ago
__init__.py
Safe
1.34 kB
add compress rate
7 months ago
convert_vocab_to_txt.py
Safe
689 Bytes
update
about 1 year ago
file_utils.py
Safe
8.38 kB
update
about 1 year ago
glm_chinese.vocab.txt
Safe
659 kB
update
about 1 year ago
sp_tokenizer.py
Safe
4.67 kB
update
about 1 year ago
test.py
Safe
115 Bytes
add compress rate
7 months ago
test_glm.py
Safe
2.5 kB
update
about 1 year ago
tokenization.py
Safe
51.9 kB
update
about 1 year ago
tokenization_gpt2.py
Safe
13.5 kB
update
about 1 year ago
utils.py
Safe
213 Bytes
update
about 1 year ago
wordpiece.py
Safe
15.5 kB
update
about 1 year ago