tokenizer used by submit model
Team Kuma
community
AI & ML interests
Large language Models
datasets
26
geniacllm/livedoor_news_corpus
Viewer
•
Updated
•
2.77k
•
45
•
1
geniacllm/wikipedia_v2
Preview
•
Updated
•
65
geniacllm/made_by_llm_and_human
Viewer
•
Updated
•
2.64k
•
46
geniacllm/hanrei
Viewer
•
Updated
•
2.9M
•
120
geniacllm/gsm8k
Viewer
•
Updated
•
1.03M
•
80
geniacllm/aozora_bunko
Viewer
•
Updated
•
10.2k
•
40
geniacllm/kokkai_v2
Preview
•
Updated
•
48
geniacllm/dataset_from_other_team
Viewer
•
Updated
•
27.1k
•
55
geniacllm/wiki40b
Viewer
•
Updated
•
1.2M
•
42
geniacllm/CulturaX_default_filtered_ja_10b
Preview
•
Updated
•
8