ProSST-2048 / vocab.txt
GinnM's picture
Upload tokenizer
c403337 verified
raw
history blame
70 Bytes
<pad>
<cls>
<eos>
A
C
D
E
F
G
H
I
K
L
M
N
P
Q
R
S
T
V
W
Y
<unk>
<mask>