3v324v23 commited on
Commit
0034259
2 Parent(s): c15e37e a9eeb59

Merge branch 'main' of https://huggingface.co/nakamura196/second-model

Browse files
Files changed (2) hide show
  1. README.md +27 -0
  2. config.json +1 -1
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - "ja"
4
+ tags:
5
+ - "japanese"
6
+ - "masked-lm"
7
+ license: "cc-by-sa-4.0"
8
+ pipeline_tag: "fill-mask"
9
+ mask_token: "[MASK]"
10
+ widget:
11
+ - text: "入[MASK]外無之候江戸大水又ハ大地震なと"
12
+ - text: "日向[MASK]御望之由可令披露候"
13
+ ---
14
+
15
+ # roberta-small-hi-char
16
+
17
+ ## Model Description
18
+
19
+ This is a RoBERTa model pre-trained on HI texts with character tokenizer.
20
+
21
+ ## How to Use
22
+
23
+ ```py
24
+ from transformers import AutoTokenizer,AutoModelForMaskedLM
25
+ tokenizer=AutoTokenizer.from_pretrained("nakamura196/roberta-small-hi-char")
26
+ model=AutoModelForMaskedLM.from_pretrained("nakamura196/roberta-small-hi-char")
27
+ ```
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "roberta-small-japanese-aozora-char-custom",
3
  "architectures": [
4
  "RobertaForMaskedLM"
5
  ],
 
1
  {
2
+ "_name_or_path": "roberta-small-hi-char",
3
  "architectures": [
4
  "RobertaForMaskedLM"
5
  ],