Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lighttransport
/
japanese-tokenizer-cc100
like
2
Japanese
License:
mit
Model card
Files
Files and versions
Community
main
japanese-tokenizer-cc100
2 contributors
History:
4 commits
syoyo
Update README.md
b914e30
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
598 Bytes
Update README.md
about 1 year ago
tokenizer-cc100-ja.json
1.2 MB
Initial.
about 1 year ago
train_jp_tokenizer.py
881 Bytes
Initial.
about 1 year ago