EC2 Default User commited on
Commit
f471992
1 Parent(s): dde4984

Update spaCy pipeline

Browse files
README.md CHANGED
@@ -14,47 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7358998362
18
  - name: NER Recall
19
  type: recall
20
- value: 0.6910989011
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7127961011
24
  - task:
25
- name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9037457747
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.7896445968
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.7286499084
41
- - name: SENTER F Score
42
  type: f_score
43
- value: 0.7579220779
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
- type: accuracy
50
- value: 0.7069146954
51
  - task:
52
- name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.7069146954
58
  ---
59
  ### Details: https://spacy.io/models/zh#zh_core_web_lg
60
 
@@ -63,8 +57,8 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `zh_core_web_lg` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `ner` |
70
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
@@ -76,13 +70,12 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
76
 
77
  <details>
78
 
79
- <summary>View label scheme (101 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`tagger`** | `AD`, `AS`, `BA`, `CC`, `CD`, `CS`, `DEC`, `DEG`, `DER`, `DEV`, `DT`, `ETC`, `FW`, `IJ`, `INF`, `JJ`, `LB`, `LC`, `M`, `MSP`, `NN`, `NR`, `NT`, `OD`, `ON`, `P`, `PN`, `PU`, `SB`, `SP`, `URL`, `VA`, `VC`, `VE`, `VV`, `X` |
84
  | **`parser`** | `ROOT`, `acl`, `advcl:loc`, `advmod`, `advmod:dvp`, `advmod:loc`, `advmod:rcomp`, `amod`, `amod:ordmod`, `appos`, `aux:asp`, `aux:ba`, `aux:modal`, `aux:prtmod`, `auxpass`, `case`, `cc`, `ccomp`, `compound:nn`, `compound:vc`, `conj`, `cop`, `dep`, `det`, `discourse`, `dobj`, `etc`, `mark`, `mark:clf`, `name`, `neg`, `nmod`, `nmod:assmod`, `nmod:poss`, `nmod:prep`, `nmod:range`, `nmod:tmod`, `nmod:topic`, `nsubj`, `nsubj:xsubj`, `nsubjpass`, `nummod`, `parataxis:prnmod`, `punct`, `xcomp` |
85
- | **`senter`** | `I`, `S` |
86
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
87
 
88
  </details>
@@ -95,12 +88,12 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
95
  | `TOKEN_P` | 94.58 |
96
  | `TOKEN_R` | 91.36 |
97
  | `TOKEN_F` | 92.94 |
98
- | `TAG_ACC` | 90.37 |
99
- | `SENTS_P` | 78.96 |
100
- | `SENTS_R` | 72.86 |
101
- | `SENTS_F` | 75.79 |
102
- | `DEP_UAS` | 70.69 |
103
- | `DEP_LAS` | 65.55 |
104
- | `ENTS_P` | 73.59 |
105
- | `ENTS_R` | 69.11 |
106
- | `ENTS_F` | 71.28 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7403037383
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.6963736264
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7176670442
24
  - task:
25
+ name: TAG
26
  type: token-classification
27
  metrics:
28
+ - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.903399232
31
  - task:
32
+ name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
+ - name: Unlabeled Attachment Score (UAS)
 
 
 
 
 
 
36
  type: f_score
37
+ value: 0.708630098
38
  - task:
39
+ name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
+ - name: Labeled Attachment Score (LAS)
43
+ type: f_score
44
+ value: 0.6570108094
45
  - task:
46
+ name: SENTS
47
  type: token-classification
48
  metrics:
49
+ - name: Sentences F-Score
50
+ type: f_score
51
+ value: 0.757283227
52
  ---
53
  ### Details: https://spacy.io/models/zh#zh_core_web_lg
54
 
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `zh_core_web_lg` |
60
+ | **Version** | `3.3.0` |
61
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `ner` |
64
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (99 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
  | **`tagger`** | `AD`, `AS`, `BA`, `CC`, `CD`, `CS`, `DEC`, `DEG`, `DER`, `DEV`, `DT`, `ETC`, `FW`, `IJ`, `INF`, `JJ`, `LB`, `LC`, `M`, `MSP`, `NN`, `NR`, `NT`, `OD`, `ON`, `P`, `PN`, `PU`, `SB`, `SP`, `URL`, `VA`, `VC`, `VE`, `VV`, `X` |
78
  | **`parser`** | `ROOT`, `acl`, `advcl:loc`, `advmod`, `advmod:dvp`, `advmod:loc`, `advmod:rcomp`, `amod`, `amod:ordmod`, `appos`, `aux:asp`, `aux:ba`, `aux:modal`, `aux:prtmod`, `auxpass`, `case`, `cc`, `ccomp`, `compound:nn`, `compound:vc`, `conj`, `cop`, `dep`, `det`, `discourse`, `dobj`, `etc`, `mark`, `mark:clf`, `name`, `neg`, `nmod`, `nmod:assmod`, `nmod:poss`, `nmod:prep`, `nmod:range`, `nmod:tmod`, `nmod:topic`, `nsubj`, `nsubj:xsubj`, `nsubjpass`, `nummod`, `parataxis:prnmod`, `punct`, `xcomp` |
 
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
81
  </details>
 
88
  | `TOKEN_P` | 94.58 |
89
  | `TOKEN_R` | 91.36 |
90
  | `TOKEN_F` | 92.94 |
91
+ | `TAG_ACC` | 90.34 |
92
+ | `SENTS_P` | 78.52 |
93
+ | `SENTS_R` | 73.13 |
94
+ | `SENTS_F` | 75.73 |
95
+ | `DEP_UAS` | 70.86 |
96
+ | `DEP_LAS` | 65.70 |
97
+ | `ENTS_P` | 74.03 |
98
+ | `ENTS_R` | 69.64 |
99
+ | `ENTS_F` | 71.77 |
accuracy.json CHANGED
@@ -3,207 +3,207 @@
3
  "token_p": 0.9458325855,
4
  "token_r": 0.9136060443,
5
  "token_f": 0.9294400505,
6
- "tag_acc": 0.9037457747,
7
- "sents_p": 0.7896445968,
8
- "sents_r": 0.7286499084,
9
- "sents_f": 0.7579220779,
10
- "dep_uas": 0.7069146954,
11
- "dep_las": 0.6555390607,
12
  "dep_las_per_type": {
13
  "dep": {
14
- "p": 0.4876810512,
15
- "r": 0.3299989896,
16
- "f": 0.3936362541
17
  },
18
  "case": {
19
- "p": 0.8168795974,
20
- "r": 0.7674587779,
21
- "f": 0.7913983872
22
  },
23
  "nmod:tmod": {
24
- "p": 0.7313237221,
25
- "r": 0.7591836735,
26
- "f": 0.7449933244
27
  },
28
  "nummod": {
29
- "p": 0.8191268191,
30
- "r": 0.5249833444,
31
- "f": 0.6398700771
32
  },
33
  "mark:clf": {
34
- "p": 0.9383017715,
35
- "r": 0.572920552,
36
- "f": 0.7114404817
37
  },
38
  "auxpass": {
39
- "p": 0.8817204301,
40
- "r": 0.8864864865,
41
- "f": 0.884097035
42
  },
43
  "nsubj": {
44
- "p": 0.7777050039,
45
- "r": 0.7292715883,
46
- "f": 0.7527099842
47
  },
48
  "acl": {
49
- "p": 0.7153127247,
50
- "r": 0.5518580144,
51
- "f": 0.623043206
52
  },
53
  "advmod": {
54
- "p": 0.8195641156,
55
- "r": 0.7331670823,
56
- "f": 0.7739619481
57
  },
58
  "mark": {
59
- "p": 0.7456996746,
60
- "r": 0.7028921998,
61
- "f": 0.7236634333
62
  },
63
  "xcomp": {
64
- "p": 0.7944444444,
65
- "r": 0.6986970684,
66
- "f": 0.7435008666
67
  },
68
  "nmod:assmod": {
69
- "p": 0.7745130406,
70
- "r": 0.7301587302,
71
- "f": 0.7516821532
72
  },
73
  "det": {
74
- "p": 0.8369132856,
75
- "r": 0.6162858817,
76
- "f": 0.709851552
77
  },
78
  "amod": {
79
- "p": 0.7794589638,
80
- "r": 0.6677140613,
81
- "f": 0.7192722657
82
  },
83
  "nmod:prep": {
84
- "p": 0.7016613644,
85
- "r": 0.6004234725,
86
- "f": 0.6471067645
87
  },
88
  "root": {
89
- "p": 0.7394862036,
90
- "r": 0.6469119361,
91
- "f": 0.6901083289
92
  },
93
  "aux:prtmod": {
94
- "p": 0.9246031746,
95
- "r": 0.8321428571,
96
- "f": 0.8759398496
97
  },
98
  "compound:nn": {
99
- "p": 0.7463895738,
100
- "r": 0.7170896785,
101
- "f": 0.7314463238
102
  },
103
  "dobj": {
104
- "p": 0.7939269334,
105
- "r": 0.7435935417,
106
- "f": 0.7679363622
107
  },
108
  "ccomp": {
109
- "p": 0.6330907698,
110
- "r": 0.6426905132,
111
- "f": 0.6378545244
112
  },
113
  "advmod:rcomp": {
114
- "p": 0.8229813665,
115
- "r": 0.7340720222,
116
- "f": 0.775988287
117
  },
118
  "nmod:topic": {
119
- "p": 0.3762886598,
120
- "r": 0.237012987,
121
- "f": 0.2908366534
122
  },
123
  "cop": {
124
- "p": 0.7518367347,
125
- "r": 0.5926640927,
126
- "f": 0.6628283555
127
  },
128
  "discourse": {
129
- "p": 0.5575139147,
130
- "r": 0.4958745875,
131
- "f": 0.5248908297
132
  },
133
  "neg": {
134
- "p": 0.8395802099,
135
- "r": 0.6658739596,
136
- "f": 0.7427055703
137
  },
138
  "aux:modal": {
139
- "p": 0.8475289169,
140
- "r": 0.8335056877,
141
- "f": 0.8404588113
142
  },
143
  "nmod": {
144
- "p": 0.7278688525,
145
- "r": 0.6024423338,
146
- "f": 0.6592427617
147
  },
148
  "aux:ba": {
149
- "p": 0.807486631,
150
- "r": 0.8031914894,
151
- "f": 0.8053333333
152
  },
153
  "advmod:loc": {
154
- "p": 0.6349206349,
155
- "r": 0.4747774481,
156
- "f": 0.5432937182
157
  },
158
  "aux:asp": {
159
- "p": 0.9013854931,
160
- "r": 0.8819776715,
161
- "f": 0.8915759774
162
  },
163
  "conj": {
164
- "p": 0.4869204402,
165
- "r": 0.5102079395,
166
- "f": 0.4982922551
167
  },
168
  "nsubjpass": {
169
- "p": 0.8048780488,
170
- "r": 0.66,
171
- "f": 0.7252747253
172
  },
173
  "compound:vc": {
174
- "p": 0.4647058824,
175
- "r": 0.4093264249,
176
- "f": 0.435261708
177
  },
178
  "advcl:loc": {
179
- "p": 0.5573770492,
180
- "r": 0.4857142857,
181
- "f": 0.5190839695
182
  },
183
  "cc": {
184
- "p": 0.7340425532,
185
- "r": 0.6734693878,
186
- "f": 0.7024525683
187
  },
188
  "advmod:dvp": {
189
- "p": 0.8320610687,
190
- "r": 0.6770186335,
191
- "f": 0.7465753425
192
  },
193
  "appos": {
194
- "p": 0.8740920097,
195
- "r": 0.8298850575,
196
- "f": 0.8514150943
197
  },
198
  "nmod:poss": {
199
- "p": 0.7341772152,
200
- "r": 0.4296296296,
201
- "f": 0.5420560748
202
  },
203
  "name": {
204
- "p": 0.6018518519,
205
- "r": 0.4814814815,
206
- "f": 0.5349794239
207
  },
208
  "nsubj:xsubj": {
209
  "p": 0.0,
@@ -211,19 +211,19 @@
211
  "f": 0.0
212
  },
213
  "nmod:range": {
214
- "p": 0.7035714286,
215
- "r": 0.6610738255,
216
- "f": 0.6816608997
217
  },
218
  "parataxis:prnmod": {
219
- "p": 0.5454545455,
220
- "r": 0.1353383459,
221
- "f": 0.2168674699
222
  },
223
  "amod:ordmod": {
224
- "p": 0.564516129,
225
- "r": 0.546875,
226
- "f": 0.5555555556
227
  },
228
  "erased": {
229
  "p": 0.0,
@@ -231,89 +231,89 @@
231
  "f": 0.0
232
  },
233
  "etc": {
234
- "p": 0.9069767442,
235
- "r": 0.9285714286,
236
- "f": 0.9176470588
237
  }
238
  },
239
- "ents_p": 0.7358998362,
240
- "ents_r": 0.6910989011,
241
- "ents_f": 0.7127961011,
242
  "ents_per_type": {
243
  "DATE": {
244
- "p": 0.7675925926,
245
- "r": 0.82160555,
246
- "f": 0.7936811872
247
  },
248
  "GPE": {
249
- "p": 0.7719060524,
250
- "r": 0.8352883675,
251
- "f": 0.8023474178
252
  },
253
  "ORDINAL": {
254
- "p": 0.8388888889,
255
- "r": 0.7947368421,
256
- "f": 0.8162162162
257
  },
258
  "FAC": {
259
- "p": 0.5581395349,
260
  "r": 0.3870967742,
261
- "f": 0.4571428571
262
  },
263
  "ORG": {
264
- "p": 0.7028571429,
265
- "r": 0.6552511416,
266
- "f": 0.6782197716
267
  },
268
  "LOC": {
269
- "p": 0.5894039735,
270
- "r": 0.4784946237,
271
- "f": 0.528189911
272
  },
273
  "QUANTITY": {
274
- "p": 0.7889908257,
275
- "r": 0.637037037,
276
- "f": 0.7049180328
277
  },
278
- "WORK_OF_ART": {
279
- "p": 0.5,
280
- "r": 0.2866666667,
281
- "f": 0.3644067797
282
  },
283
  "CARDINAL": {
284
- "p": 0.614744352,
285
- "r": 0.5211693548,
286
- "f": 0.5641025641
287
  },
288
  "NORP": {
289
- "p": 0.6755952381,
290
- "r": 0.4768907563,
291
- "f": 0.5591133005
292
  },
293
  "TIME": {
294
- "p": 0.7365853659,
295
- "r": 0.7330097087,
296
- "f": 0.7347931873
 
 
 
 
 
297
  },
298
  "MONEY": {
299
- "p": 0.9322033898,
300
- "r": 0.8148148148,
301
- "f": 0.8695652174
302
  },
303
  "EVENT": {
304
- "p": 0.5681818182,
305
- "r": 0.3676470588,
306
- "f": 0.4464285714
307
- },
308
- "PERSON": {
309
- "p": 0.8077682686,
310
- "r": 0.7905927835,
311
- "f": 0.7990882449
312
  },
313
  "PERCENT": {
314
- "p": 0.7882352941,
315
- "r": 0.8072289157,
316
- "f": 0.7976190476
317
  },
318
  "PRODUCT": {
319
  "p": 0.0,
@@ -321,15 +321,15 @@
321
  "f": 0.0
322
  },
323
  "LAW": {
324
- "p": 0.3333333333,
325
- "r": 0.1,
326
- "f": 0.1538461538
327
  },
328
  "LANGUAGE": {
329
- "p": 0.5555555556,
330
- "r": 0.5555555556,
331
- "f": 0.5555555556
332
  }
333
  },
334
- "speed": 7127.6040150529
335
  }
 
3
  "token_p": 0.9458325855,
4
  "token_r": 0.9136060443,
5
  "token_f": 0.9294400505,
6
+ "tag_acc": 0.903399232,
7
+ "sents_p": 0.7851653262,
8
+ "sents_r": 0.7313134676,
9
+ "sents_f": 0.757283227,
10
+ "dep_uas": 0.708630098,
11
+ "dep_las": 0.6570108094,
12
  "dep_las_per_type": {
13
  "dep": {
14
+ "p": 0.4873308379,
15
+ "r": 0.3420228352,
16
+ "f": 0.4019473965
17
  },
18
  "case": {
19
+ "p": 0.8121243126,
20
+ "r": 0.7698836081,
21
+ "f": 0.7904400324
22
  },
23
  "nmod:tmod": {
24
+ "p": 0.7419786096,
25
+ "r": 0.7551020408,
26
+ "f": 0.7484828051
27
  },
28
  "nummod": {
29
+ "p": 0.8179043744,
30
+ "r": 0.5356429047,
31
+ "f": 0.6473429952
32
  },
33
  "mark:clf": {
34
+ "p": 0.9362745098,
35
+ "r": 0.5699365908,
36
+ "f": 0.7085555298
37
  },
38
  "auxpass": {
39
+ "p": 0.8617021277,
40
+ "r": 0.8756756757,
41
+ "f": 0.8686327078
42
  },
43
  "nsubj": {
44
+ "p": 0.7863859092,
45
+ "r": 0.7293944233,
46
+ "f": 0.7568187612
47
  },
48
  "acl": {
49
+ "p": 0.6861842105,
50
+ "r": 0.5784803106,
51
+ "f": 0.6277460126
52
  },
53
  "advmod": {
54
+ "p": 0.8230973788,
55
+ "r": 0.7367943777,
56
+ "f": 0.7775584664
57
  },
58
  "mark": {
59
+ "p": 0.7435536803,
60
+ "r": 0.6950043821,
61
+ "f": 0.7184597961
62
  },
63
  "xcomp": {
64
+ "p": 0.7836363636,
65
+ "r": 0.7019543974,
66
+ "f": 0.7405498282
67
  },
68
  "nmod:assmod": {
69
+ "p": 0.763022508,
70
+ "r": 0.7385620915,
71
+ "f": 0.7505930729
72
  },
73
  "det": {
74
+ "p": 0.8353317346,
75
+ "r": 0.6121851201,
76
+ "f": 0.7065584855
77
  },
78
  "amod": {
79
+ "p": 0.7771274201,
80
+ "r": 0.6779261587,
81
+ "f": 0.7241451647
82
  },
83
  "nmod:prep": {
84
+ "p": 0.6958174905,
85
+ "r": 0.608892922,
86
+ "f": 0.6494595903
87
  },
88
  "root": {
89
+ "p": 0.74281935,
90
+ "r": 0.6544031963,
91
+ "f": 0.6958137888
92
  },
93
  "aux:prtmod": {
94
+ "p": 0.8976377953,
95
+ "r": 0.8142857143,
96
+ "f": 0.8539325843
97
  },
98
  "compound:nn": {
99
+ "p": 0.7375549692,
100
+ "r": 0.7094754653,
101
+ "f": 0.7232427771
102
  },
103
  "dobj": {
104
+ "p": 0.8049725541,
105
+ "r": 0.7385572508,
106
+ "f": 0.7703360371
107
  },
108
  "ccomp": {
109
+ "p": 0.65,
110
+ "r": 0.6318040435,
111
+ "f": 0.6407728707
112
  },
113
  "advmod:rcomp": {
114
+ "p": 0.8198757764,
115
+ "r": 0.7313019391,
116
+ "f": 0.7730600293
117
  },
118
  "nmod:topic": {
119
+ "p": 0.3668122271,
120
+ "r": 0.2727272727,
121
+ "f": 0.312849162
122
  },
123
  "cop": {
124
+ "p": 0.7520325203,
125
+ "r": 0.5952380952,
126
+ "f": 0.6645114943
127
  },
128
  "discourse": {
129
+ "p": 0.572761194,
130
+ "r": 0.5066006601,
131
+ "f": 0.5376532399
132
  },
133
  "neg": {
134
+ "p": 0.8438438438,
135
+ "r": 0.6682520809,
136
+ "f": 0.7458526875
137
  },
138
  "aux:modal": {
139
+ "p": 0.862911796,
140
+ "r": 0.8397104447,
141
+ "f": 0.8511530398
142
  },
143
  "nmod": {
144
+ "p": 0.7196850394,
145
+ "r": 0.6200814111,
146
+ "f": 0.666180758
147
  },
148
  "aux:ba": {
149
+ "p": 0.8202247191,
150
+ "r": 0.7765957447,
151
+ "f": 0.7978142077
152
  },
153
  "advmod:loc": {
154
+ "p": 0.6396761134,
155
+ "r": 0.46884273,
156
+ "f": 0.5410958904
157
  },
158
  "aux:asp": {
159
+ "p": 0.9109816972,
160
+ "r": 0.8732057416,
161
+ "f": 0.8916938111
162
  },
163
  "conj": {
164
+ "p": 0.5052874447,
165
+ "r": 0.4967863894,
166
+ "f": 0.5010008579
167
  },
168
  "nsubjpass": {
169
+ "p": 0.8205128205,
170
+ "r": 0.64,
171
+ "f": 0.7191011236
172
  },
173
  "compound:vc": {
174
+ "p": 0.4213483146,
175
+ "r": 0.3886010363,
176
+ "f": 0.4043126685
177
  },
178
  "advcl:loc": {
179
+ "p": 0.5267175573,
180
+ "r": 0.4928571429,
181
+ "f": 0.5092250923
182
  },
183
  "cc": {
184
+ "p": 0.7130600572,
185
+ "r": 0.6637089618,
186
+ "f": 0.6875
187
  },
188
  "advmod:dvp": {
189
+ "p": 0.8536585366,
190
+ "r": 0.652173913,
191
+ "f": 0.7394366197
192
  },
193
  "appos": {
194
+ "p": 0.8705035971,
195
+ "r": 0.8344827586,
196
+ "f": 0.8521126761
197
  },
198
  "nmod:poss": {
199
+ "p": 0.6947368421,
200
+ "r": 0.4888888889,
201
+ "f": 0.5739130435
202
  },
203
  "name": {
204
+ "p": 0.71,
205
+ "r": 0.5259259259,
206
+ "f": 0.6042553191
207
  },
208
  "nsubj:xsubj": {
209
  "p": 0.0,
 
211
  "f": 0.0
212
  },
213
  "nmod:range": {
214
+ "p": 0.7354085603,
215
+ "r": 0.6342281879,
216
+ "f": 0.6810810811
217
  },
218
  "parataxis:prnmod": {
219
+ "p": 0.4193548387,
220
+ "r": 0.0977443609,
221
+ "f": 0.1585365854
222
  },
223
  "amod:ordmod": {
224
+ "p": 0.606557377,
225
+ "r": 0.578125,
226
+ "f": 0.592
227
  },
228
  "erased": {
229
  "p": 0.0,
 
231
  "f": 0.0
232
  },
233
  "etc": {
234
+ "p": 0.8941176471,
235
+ "r": 0.9047619048,
236
+ "f": 0.899408284
237
  }
238
  },
239
+ "ents_p": 0.7403037383,
240
+ "ents_r": 0.6963736264,
241
+ "ents_f": 0.7176670442,
242
  "ents_per_type": {
243
  "DATE": {
244
+ "p": 0.7652011225,
245
+ "r": 0.810703667,
246
+ "f": 0.7872954764
247
  },
248
  "GPE": {
249
+ "p": 0.7768142401,
250
+ "r": 0.8318670577,
251
+ "f": 0.8033986311
252
  },
253
  "ORDINAL": {
254
+ "p": 0.8705882353,
255
+ "r": 0.7789473684,
256
+ "f": 0.8222222222
257
  },
258
  "FAC": {
259
+ "p": 0.464516129,
260
  "r": 0.3870967742,
261
+ "f": 0.42228739
262
  },
263
  "ORG": {
264
+ "p": 0.7108042242,
265
+ "r": 0.6659056317,
266
+ "f": 0.6876227898
267
  },
268
  "LOC": {
269
+ "p": 0.5685618729,
270
+ "r": 0.4569892473,
271
+ "f": 0.5067064083
272
  },
273
  "QUANTITY": {
274
+ "p": 0.7663551402,
275
+ "r": 0.6074074074,
276
+ "f": 0.6776859504
277
  },
278
+ "PERSON": {
279
+ "p": 0.8168642951,
280
+ "r": 0.7989690722,
281
+ "f": 0.8078175896
282
  },
283
  "CARDINAL": {
284
+ "p": 0.6218097448,
285
+ "r": 0.5403225806,
286
+ "f": 0.5782092772
287
  },
288
  "NORP": {
289
+ "p": 0.701183432,
290
+ "r": 0.4978991597,
291
+ "f": 0.5823095823
292
  },
293
  "TIME": {
294
+ "p": 0.7427184466,
295
+ "r": 0.7427184466,
296
+ "f": 0.7427184466
297
+ },
298
+ "WORK_OF_ART": {
299
+ "p": 0.676056338,
300
+ "r": 0.32,
301
+ "f": 0.4343891403
302
  },
303
  "MONEY": {
304
+ "p": 0.9411764706,
305
+ "r": 0.8296296296,
306
+ "f": 0.8818897638
307
  },
308
  "EVENT": {
309
+ "p": 0.6082474227,
310
+ "r": 0.4338235294,
311
+ "f": 0.5064377682
 
 
 
 
 
312
  },
313
  "PERCENT": {
314
+ "p": 0.8095238095,
315
+ "r": 0.8192771084,
316
+ "f": 0.8143712575
317
  },
318
  "PRODUCT": {
319
  "p": 0.0,
 
321
  "f": 0.0
322
  },
323
  "LAW": {
324
+ "p": 0.4230769231,
325
+ "r": 0.1833333333,
326
+ "f": 0.2558139535
327
  },
328
  "LANGUAGE": {
329
+ "p": 0.6,
330
+ "r": 0.6666666667,
331
+ "f": 0.6315789474
332
  }
333
  },
334
+ "speed": 7558.2542061289
335
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -51,7 +51,7 @@ nO = null
51
  @architectures = "spacy.MultiHashEmbed.v2"
52
  width = 96
53
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
- rows = [5000,2500,2500,2500]
55
  include_static_vectors = true
56
 
57
  [components.ner.model.tok2vec.encode]
@@ -89,8 +89,9 @@ overwrite = false
89
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
90
 
91
  [components.senter.model]
92
- @architectures = "spacy.Tagger.v1"
93
  nO = null
 
94
 
95
  [components.senter.model.tok2vec]
96
  @architectures = "spacy.Tok2Vec.v2"
@@ -111,12 +112,14 @@ maxout_pieces = 2
111
 
112
  [components.tagger]
113
  factory = "tagger"
 
114
  overwrite = false
115
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
116
 
117
  [components.tagger.model]
118
- @architectures = "spacy.Tagger.v1"
119
  nO = null
 
120
 
121
  [components.tagger.model.tok2vec]
122
  @architectures = "spacy.Tok2VecListener.v1"
@@ -133,7 +136,7 @@ factory = "tok2vec"
133
  @architectures = "spacy.MultiHashEmbed.v2"
134
  width = ${components.tok2vec.model.encode:width}
135
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
136
- rows = [5000,2500,2500,2500]
137
  include_static_vectors = true
138
 
139
  [components.tok2vec.model.encode]
@@ -170,7 +173,7 @@ dropout = 0.1
170
  accumulate_gradient = 1
171
  patience = 5000
172
  max_epochs = 0
173
- max_steps = 0
174
  eval_frequency = 1000
175
  frozen_components = []
176
  before_to_disk = null
 
51
  @architectures = "spacy.MultiHashEmbed.v2"
52
  width = 96
53
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
+ rows = [5000,1000,2500,2500]
55
  include_static_vectors = true
56
 
57
  [components.ner.model.tok2vec.encode]
 
89
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
90
 
91
  [components.senter.model]
92
+ @architectures = "spacy.Tagger.v2"
93
  nO = null
94
+ normalize = false
95
 
96
  [components.senter.model.tok2vec]
97
  @architectures = "spacy.Tok2Vec.v2"
 
112
 
113
  [components.tagger]
114
  factory = "tagger"
115
+ neg_prefix = "!"
116
  overwrite = false
117
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
118
 
119
  [components.tagger.model]
120
+ @architectures = "spacy.Tagger.v2"
121
  nO = null
122
+ normalize = false
123
 
124
  [components.tagger.model.tok2vec]
125
  @architectures = "spacy.Tok2VecListener.v1"
 
136
  @architectures = "spacy.MultiHashEmbed.v2"
137
  width = ${components.tok2vec.model.encode:width}
138
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
139
+ rows = [5000,1000,2500,2500]
140
  include_static_vectors = true
141
 
142
  [components.tok2vec.model.encode]
 
173
  accumulate_gradient = 1
174
  patience = 5000
175
  max_epochs = 0
176
+ max_steps = 100000
177
  eval_frequency = 1000
178
  frozen_components = []
179
  before_to_disk = null
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"zh",
3
  "name":"core_web_lg",
4
- "version":"3.2.0",
5
  "description":"Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
@@ -104,10 +104,6 @@
104
  "punct",
105
  "xcomp"
106
  ],
107
- "senter":[
108
- "I",
109
- "S"
110
- ],
111
  "attribute_ruler":[
112
 
113
  ],
@@ -155,207 +151,207 @@
155
  "token_p":0.9458325855,
156
  "token_r":0.9136060443,
157
  "token_f":0.9294400505,
158
- "tag_acc":0.9037457747,
159
- "sents_p":0.7896445968,
160
- "sents_r":0.7286499084,
161
- "sents_f":0.7579220779,
162
- "dep_uas":0.7069146954,
163
- "dep_las":0.6555390607,
164
  "dep_las_per_type":{
165
  "dep":{
166
- "p":0.4876810512,
167
- "r":0.3299989896,
168
- "f":0.3936362541
169
  },
170
  "case":{
171
- "p":0.8168795974,
172
- "r":0.7674587779,
173
- "f":0.7913983872
174
  },
175
  "nmod:tmod":{
176
- "p":0.7313237221,
177
- "r":0.7591836735,
178
- "f":0.7449933244
179
  },
180
  "nummod":{
181
- "p":0.8191268191,
182
- "r":0.5249833444,
183
- "f":0.6398700771
184
  },
185
  "mark:clf":{
186
- "p":0.9383017715,
187
- "r":0.572920552,
188
- "f":0.7114404817
189
  },
190
  "auxpass":{
191
- "p":0.8817204301,
192
- "r":0.8864864865,
193
- "f":0.884097035
194
  },
195
  "nsubj":{
196
- "p":0.7777050039,
197
- "r":0.7292715883,
198
- "f":0.7527099842
199
  },
200
  "acl":{
201
- "p":0.7153127247,
202
- "r":0.5518580144,
203
- "f":0.623043206
204
  },
205
  "advmod":{
206
- "p":0.8195641156,
207
- "r":0.7331670823,
208
- "f":0.7739619481
209
  },
210
  "mark":{
211
- "p":0.7456996746,
212
- "r":0.7028921998,
213
- "f":0.7236634333
214
  },
215
  "xcomp":{
216
- "p":0.7944444444,
217
- "r":0.6986970684,
218
- "f":0.7435008666
219
  },
220
  "nmod:assmod":{
221
- "p":0.7745130406,
222
- "r":0.7301587302,
223
- "f":0.7516821532
224
  },
225
  "det":{
226
- "p":0.8369132856,
227
- "r":0.6162858817,
228
- "f":0.709851552
229
  },
230
  "amod":{
231
- "p":0.7794589638,
232
- "r":0.6677140613,
233
- "f":0.7192722657
234
  },
235
  "nmod:prep":{
236
- "p":0.7016613644,
237
- "r":0.6004234725,
238
- "f":0.6471067645
239
  },
240
  "root":{
241
- "p":0.7394862036,
242
- "r":0.6469119361,
243
- "f":0.6901083289
244
  },
245
  "aux:prtmod":{
246
- "p":0.9246031746,
247
- "r":0.8321428571,
248
- "f":0.8759398496
249
  },
250
  "compound:nn":{
251
- "p":0.7463895738,
252
- "r":0.7170896785,
253
- "f":0.7314463238
254
  },
255
  "dobj":{
256
- "p":0.7939269334,
257
- "r":0.7435935417,
258
- "f":0.7679363622
259
  },
260
  "ccomp":{
261
- "p":0.6330907698,
262
- "r":0.6426905132,
263
- "f":0.6378545244
264
  },
265
  "advmod:rcomp":{
266
- "p":0.8229813665,
267
- "r":0.7340720222,
268
- "f":0.775988287
269
  },
270
  "nmod:topic":{
271
- "p":0.3762886598,
272
- "r":0.237012987,
273
- "f":0.2908366534
274
  },
275
  "cop":{
276
- "p":0.7518367347,
277
- "r":0.5926640927,
278
- "f":0.6628283555
279
  },
280
  "discourse":{
281
- "p":0.5575139147,
282
- "r":0.4958745875,
283
- "f":0.5248908297
284
  },
285
  "neg":{
286
- "p":0.8395802099,
287
- "r":0.6658739596,
288
- "f":0.7427055703
289
  },
290
  "aux:modal":{
291
- "p":0.8475289169,
292
- "r":0.8335056877,
293
- "f":0.8404588113
294
  },
295
  "nmod":{
296
- "p":0.7278688525,
297
- "r":0.6024423338,
298
- "f":0.6592427617
299
  },
300
  "aux:ba":{
301
- "p":0.807486631,
302
- "r":0.8031914894,
303
- "f":0.8053333333
304
  },
305
  "advmod:loc":{
306
- "p":0.6349206349,
307
- "r":0.4747774481,
308
- "f":0.5432937182
309
  },
310
  "aux:asp":{
311
- "p":0.9013854931,
312
- "r":0.8819776715,
313
- "f":0.8915759774
314
  },
315
  "conj":{
316
- "p":0.4869204402,
317
- "r":0.5102079395,
318
- "f":0.4982922551
319
  },
320
  "nsubjpass":{
321
- "p":0.8048780488,
322
- "r":0.66,
323
- "f":0.7252747253
324
  },
325
  "compound:vc":{
326
- "p":0.4647058824,
327
- "r":0.4093264249,
328
- "f":0.435261708
329
  },
330
  "advcl:loc":{
331
- "p":0.5573770492,
332
- "r":0.4857142857,
333
- "f":0.5190839695
334
  },
335
  "cc":{
336
- "p":0.7340425532,
337
- "r":0.6734693878,
338
- "f":0.7024525683
339
  },
340
  "advmod:dvp":{
341
- "p":0.8320610687,
342
- "r":0.6770186335,
343
- "f":0.7465753425
344
  },
345
  "appos":{
346
- "p":0.8740920097,
347
- "r":0.8298850575,
348
- "f":0.8514150943
349
  },
350
  "nmod:poss":{
351
- "p":0.7341772152,
352
- "r":0.4296296296,
353
- "f":0.5420560748
354
  },
355
  "name":{
356
- "p":0.6018518519,
357
- "r":0.4814814815,
358
- "f":0.5349794239
359
  },
360
  "nsubj:xsubj":{
361
  "p":0.0,
@@ -363,19 +359,19 @@
363
  "f":0.0
364
  },
365
  "nmod:range":{
366
- "p":0.7035714286,
367
- "r":0.6610738255,
368
- "f":0.6816608997
369
  },
370
  "parataxis:prnmod":{
371
- "p":0.5454545455,
372
- "r":0.1353383459,
373
- "f":0.2168674699
374
  },
375
  "amod:ordmod":{
376
- "p":0.564516129,
377
- "r":0.546875,
378
- "f":0.5555555556
379
  },
380
  "erased":{
381
  "p":0.0,
@@ -383,89 +379,89 @@
383
  "f":0.0
384
  },
385
  "etc":{
386
- "p":0.9069767442,
387
- "r":0.9285714286,
388
- "f":0.9176470588
389
  }
390
  },
391
- "ents_p":0.7358998362,
392
- "ents_r":0.6910989011,
393
- "ents_f":0.7127961011,
394
  "ents_per_type":{
395
  "DATE":{
396
- "p":0.7675925926,
397
- "r":0.82160555,
398
- "f":0.7936811872
399
  },
400
  "GPE":{
401
- "p":0.7719060524,
402
- "r":0.8352883675,
403
- "f":0.8023474178
404
  },
405
  "ORDINAL":{
406
- "p":0.8388888889,
407
- "r":0.7947368421,
408
- "f":0.8162162162
409
  },
410
  "FAC":{
411
- "p":0.5581395349,
412
  "r":0.3870967742,
413
- "f":0.4571428571
414
  },
415
  "ORG":{
416
- "p":0.7028571429,
417
- "r":0.6552511416,
418
- "f":0.6782197716
419
  },
420
  "LOC":{
421
- "p":0.5894039735,
422
- "r":0.4784946237,
423
- "f":0.528189911
424
  },
425
  "QUANTITY":{
426
- "p":0.7889908257,
427
- "r":0.637037037,
428
- "f":0.7049180328
429
  },
430
- "WORK_OF_ART":{
431
- "p":0.5,
432
- "r":0.2866666667,
433
- "f":0.3644067797
434
  },
435
  "CARDINAL":{
436
- "p":0.614744352,
437
- "r":0.5211693548,
438
- "f":0.5641025641
439
  },
440
  "NORP":{
441
- "p":0.6755952381,
442
- "r":0.4768907563,
443
- "f":0.5591133005
444
  },
445
  "TIME":{
446
- "p":0.7365853659,
447
- "r":0.7330097087,
448
- "f":0.7347931873
 
 
 
 
 
449
  },
450
  "MONEY":{
451
- "p":0.9322033898,
452
- "r":0.8148148148,
453
- "f":0.8695652174
454
  },
455
  "EVENT":{
456
- "p":0.5681818182,
457
- "r":0.3676470588,
458
- "f":0.4464285714
459
- },
460
- "PERSON":{
461
- "p":0.8077682686,
462
- "r":0.7905927835,
463
- "f":0.7990882449
464
  },
465
  "PERCENT":{
466
- "p":0.7882352941,
467
- "r":0.8072289157,
468
- "f":0.7976190476
469
  },
470
  "PRODUCT":{
471
  "p":0.0,
@@ -473,17 +469,17 @@
473
  "f":0.0
474
  },
475
  "LAW":{
476
- "p":0.3333333333,
477
- "r":0.1,
478
- "f":0.1538461538
479
  },
480
  "LANGUAGE":{
481
- "p":0.5555555556,
482
- "r":0.5555555556,
483
- "f":0.5555555556
484
  }
485
  },
486
- "speed":7127.6040150529
487
  },
488
  "sources":[
489
  {
 
1
  {
2
  "lang":"zh",
3
  "name":"core_web_lg",
4
+ "version":"3.3.0",
5
  "description":"Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
 
104
  "punct",
105
  "xcomp"
106
  ],
 
 
 
 
107
  "attribute_ruler":[
108
 
109
  ],
 
151
  "token_p":0.9458325855,
152
  "token_r":0.9136060443,
153
  "token_f":0.9294400505,
154
+ "tag_acc":0.903399232,
155
+ "sents_p":0.7851653262,
156
+ "sents_r":0.7313134676,
157
+ "sents_f":0.757283227,
158
+ "dep_uas":0.708630098,
159
+ "dep_las":0.6570108094,
160
  "dep_las_per_type":{
161
  "dep":{
162
+ "p":0.4873308379,
163
+ "r":0.3420228352,
164
+ "f":0.4019473965
165
  },
166
  "case":{
167
+ "p":0.8121243126,
168
+ "r":0.7698836081,
169
+ "f":0.7904400324
170
  },
171
  "nmod:tmod":{
172
+ "p":0.7419786096,
173
+ "r":0.7551020408,
174
+ "f":0.7484828051
175
  },
176
  "nummod":{
177
+ "p":0.8179043744,
178
+ "r":0.5356429047,
179
+ "f":0.6473429952
180
  },
181
  "mark:clf":{
182
+ "p":0.9362745098,
183
+ "r":0.5699365908,
184
+ "f":0.7085555298
185
  },
186
  "auxpass":{
187
+ "p":0.8617021277,
188
+ "r":0.8756756757,
189
+ "f":0.8686327078
190
  },
191
  "nsubj":{
192
+ "p":0.7863859092,
193
+ "r":0.7293944233,
194
+ "f":0.7568187612
195
  },
196
  "acl":{
197
+ "p":0.6861842105,
198
+ "r":0.5784803106,
199
+ "f":0.6277460126
200
  },
201
  "advmod":{
202
+ "p":0.8230973788,
203
+ "r":0.7367943777,
204
+ "f":0.7775584664
205
  },
206
  "mark":{
207
+ "p":0.7435536803,
208
+ "r":0.6950043821,
209
+ "f":0.7184597961
210
  },
211
  "xcomp":{
212
+ "p":0.7836363636,
213
+ "r":0.7019543974,
214
+ "f":0.7405498282
215
  },
216
  "nmod:assmod":{
217
+ "p":0.763022508,
218
+ "r":0.7385620915,
219
+ "f":0.7505930729
220
  },
221
  "det":{
222
+ "p":0.8353317346,
223
+ "r":0.6121851201,
224
+ "f":0.7065584855
225
  },
226
  "amod":{
227
+ "p":0.7771274201,
228
+ "r":0.6779261587,
229
+ "f":0.7241451647
230
  },
231
  "nmod:prep":{
232
+ "p":0.6958174905,
233
+ "r":0.608892922,
234
+ "f":0.6494595903
235
  },
236
  "root":{
237
+ "p":0.74281935,
238
+ "r":0.6544031963,
239
+ "f":0.6958137888
240
  },
241
  "aux:prtmod":{
242
+ "p":0.8976377953,
243
+ "r":0.8142857143,
244
+ "f":0.8539325843
245
  },
246
  "compound:nn":{
247
+ "p":0.7375549692,
248
+ "r":0.7094754653,
249
+ "f":0.7232427771
250
  },
251
  "dobj":{
252
+ "p":0.8049725541,
253
+ "r":0.7385572508,
254
+ "f":0.7703360371
255
  },
256
  "ccomp":{
257
+ "p":0.65,
258
+ "r":0.6318040435,
259
+ "f":0.6407728707
260
  },
261
  "advmod:rcomp":{
262
+ "p":0.8198757764,
263
+ "r":0.7313019391,
264
+ "f":0.7730600293
265
  },
266
  "nmod:topic":{
267
+ "p":0.3668122271,
268
+ "r":0.2727272727,
269
+ "f":0.312849162
270
  },
271
  "cop":{
272
+ "p":0.7520325203,
273
+ "r":0.5952380952,
274
+ "f":0.6645114943
275
  },
276
  "discourse":{
277
+ "p":0.572761194,
278
+ "r":0.5066006601,
279
+ "f":0.5376532399
280
  },
281
  "neg":{
282
+ "p":0.8438438438,
283
+ "r":0.6682520809,
284
+ "f":0.7458526875
285
  },
286
  "aux:modal":{
287
+ "p":0.862911796,
288
+ "r":0.8397104447,
289
+ "f":0.8511530398
290
  },
291
  "nmod":{
292
+ "p":0.7196850394,
293
+ "r":0.6200814111,
294
+ "f":0.666180758
295
  },
296
  "aux:ba":{
297
+ "p":0.8202247191,
298
+ "r":0.7765957447,
299
+ "f":0.7978142077
300
  },
301
  "advmod:loc":{
302
+ "p":0.6396761134,
303
+ "r":0.46884273,
304
+ "f":0.5410958904
305
  },
306
  "aux:asp":{
307
+ "p":0.9109816972,
308
+ "r":0.8732057416,
309
+ "f":0.8916938111
310
  },
311
  "conj":{
312
+ "p":0.5052874447,
313
+ "r":0.4967863894,
314
+ "f":0.5010008579
315
  },
316
  "nsubjpass":{
317
+ "p":0.8205128205,
318
+ "r":0.64,
319
+ "f":0.7191011236
320
  },
321
  "compound:vc":{
322
+ "p":0.4213483146,
323
+ "r":0.3886010363,
324
+ "f":0.4043126685
325
  },
326
  "advcl:loc":{
327
+ "p":0.5267175573,
328
+ "r":0.4928571429,
329
+ "f":0.5092250923
330
  },
331
  "cc":{
332
+ "p":0.7130600572,
333
+ "r":0.6637089618,
334
+ "f":0.6875
335
  },
336
  "advmod:dvp":{
337
+ "p":0.8536585366,
338
+ "r":0.652173913,
339
+ "f":0.7394366197
340
  },
341
  "appos":{
342
+ "p":0.8705035971,
343
+ "r":0.8344827586,
344
+ "f":0.8521126761
345
  },
346
  "nmod:poss":{
347
+ "p":0.6947368421,
348
+ "r":0.4888888889,
349
+ "f":0.5739130435
350
  },
351
  "name":{
352
+ "p":0.71,
353
+ "r":0.5259259259,
354
+ "f":0.6042553191
355
  },
356
  "nsubj:xsubj":{
357
  "p":0.0,
 
359
  "f":0.0
360
  },
361
  "nmod:range":{
362
+ "p":0.7354085603,
363
+ "r":0.6342281879,
364
+ "f":0.6810810811
365
  },
366
  "parataxis:prnmod":{
367
+ "p":0.4193548387,
368
+ "r":0.0977443609,
369
+ "f":0.1585365854
370
  },
371
  "amod:ordmod":{
372
+ "p":0.606557377,
373
+ "r":0.578125,
374
+ "f":0.592
375
  },
376
  "erased":{
377
  "p":0.0,
 
379
  "f":0.0
380
  },
381
  "etc":{
382
+ "p":0.8941176471,
383
+ "r":0.9047619048,
384
+ "f":0.899408284
385
  }
386
  },
387
+ "ents_p":0.7403037383,
388
+ "ents_r":0.6963736264,
389
+ "ents_f":0.7176670442,
390
  "ents_per_type":{
391
  "DATE":{
392
+ "p":0.7652011225,
393
+ "r":0.810703667,
394
+ "f":0.7872954764
395
  },
396
  "GPE":{
397
+ "p":0.7768142401,
398
+ "r":0.8318670577,
399
+ "f":0.8033986311
400
  },
401
  "ORDINAL":{
402
+ "p":0.8705882353,
403
+ "r":0.7789473684,
404
+ "f":0.8222222222
405
  },
406
  "FAC":{
407
+ "p":0.464516129,
408
  "r":0.3870967742,
409
+ "f":0.42228739
410
  },
411
  "ORG":{
412
+ "p":0.7108042242,
413
+ "r":0.6659056317,
414
+ "f":0.6876227898
415
  },
416
  "LOC":{
417
+ "p":0.5685618729,
418
+ "r":0.4569892473,
419
+ "f":0.5067064083
420
  },
421
  "QUANTITY":{
422
+ "p":0.7663551402,
423
+ "r":0.6074074074,
424
+ "f":0.6776859504
425
  },
426
+ "PERSON":{
427
+ "p":0.8168642951,
428
+ "r":0.7989690722,
429
+ "f":0.8078175896
430
  },
431
  "CARDINAL":{
432
+ "p":0.6218097448,
433
+ "r":0.5403225806,
434
+ "f":0.5782092772
435
  },
436
  "NORP":{
437
+ "p":0.701183432,
438
+ "r":0.4978991597,
439
+ "f":0.5823095823
440
  },
441
  "TIME":{
442
+ "p":0.7427184466,
443
+ "r":0.7427184466,
444
+ "f":0.7427184466
445
+ },
446
+ "WORK_OF_ART":{
447
+ "p":0.676056338,
448
+ "r":0.32,
449
+ "f":0.4343891403
450
  },
451
  "MONEY":{
452
+ "p":0.9411764706,
453
+ "r":0.8296296296,
454
+ "f":0.8818897638
455
  },
456
  "EVENT":{
457
+ "p":0.6082474227,
458
+ "r":0.4338235294,
459
+ "f":0.5064377682
 
 
 
 
 
460
  },
461
  "PERCENT":{
462
+ "p":0.8095238095,
463
+ "r":0.8192771084,
464
+ "f":0.8143712575
465
  },
466
  "PRODUCT":{
467
  "p":0.0,
 
469
  "f":0.0
470
  },
471
  "LAW":{
472
+ "p":0.4230769231,
473
+ "r":0.1833333333,
474
+ "f":0.2558139535
475
  },
476
  "LANGUAGE":{
477
+ "p":0.6,
478
+ "r":0.6666666667,
479
+ "f":0.6315789474
480
  }
481
  },
482
+ "speed":7558.2542061289
483
  },
484
  "sources":[
485
  {
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:086f9e93f467aff3c67d56dd981b6cd71a5236018fec71a91834ab4065ccb2d5
3
- size 6956943
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36df6f768da2d9216c002d79f0f1d2f116970b291aed104db473d6908881f784
3
+ size 6380943
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9eafa2b12e46da1a900ca46fe526a9a62064d789670c51b1cfc3689f426efbae
3
  size 308728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:47b64febd3e2df5aa1608c3d8bbe20165c6df3399775e57d2402a67ed416f98c
3
  size 308728
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{"":406716},"1":{"":267231},"2":{"advmod":56960,"nsubj":53520,"compound:nn":43919,"dep":40111,"punct":36035,"case":23986,"nmod:assmod":21599,"nmod:prep":20098,"amod":16922,"acl":11979,"conj":10687,"cop":7238,"det":7210,"nummod":6994,"cc":6235,"aux:modal":5566,"nmod:tmod":5335,"nmod":4915,"neg":4363,"xcomp":3881,"appos":2955,"nmod:topic":2410,"discourse":2163,"advmod:loc":1591,"aux:prtmod":1539,"aux:ba":1311,"auxpass":1220,"advmod:dvp":1142,"advcl:loc":1046,"name":1032,"compound:vc":830,"nmod:poss":560,"amod:ordmod":511,"dobj":406,"nsubjpass":263,"nsubj:xsubj||ccomp":62,"parataxis:prnmod":34,"nsubj:xsubj":32},"3":{"punct":74006,"dobj":45383,"conj":30040,"case":30024,"dep":18660,"ccomp":17216,"mark":16600,"mark:clf":11551,"aux:asp":7896,"discourse":3998,"advmod:rcomp":2387,"nmod:range":1885,"cc":1675,"nmod:prep":1595,"advmod":1116,"etc":941,"compound:vc":790,"parataxis:prnmod":693,"advmod:loc":522,"neg":69,"advcl:loc":39,"acl":39},"4":{"ROOT":34525}}�cfg��neg_key�
 
1
+ ��moves��{"0":{"":436297},"1":{"":282750},"2":{"advmod":61142,"nsubj":55539,"compound:nn":45994,"dep":43937,"punct":36396,"case":24751,"nmod:assmod":22308,"nmod:prep":21037,"amod":18609,"acl":12438,"conj":10993,"det":10371,"nummod":9922,"cop":9515,"cc":6289,"aux:modal":6003,"neg":5955,"nmod:tmod":5338,"nmod":5049,"xcomp":4333,"appos":2988,"nmod:topic":2532,"discourse":2283,"advmod:loc":1902,"aux:prtmod":1724,"aux:ba":1323,"auxpass":1240,"advmod:dvp":1193,"name":1117,"advcl:loc":1072,"compound:vc":834,"nmod:poss":657,"amod:ordmod":601,"dobj":441,"nsubjpass":276,"nsubj:xsubj||ccomp":64,"parataxis:prnmod":36,"nsubj:xsubj":32},"3":{"punct":74587,"dobj":46958,"conj":31352,"case":31222,"dep":20953,"mark:clf":18377,"ccomp":17748,"mark":16793,"aux:asp":8130,"discourse":4187,"advmod:rcomp":2519,"nmod:range":2021,"cc":1715,"nmod:prep":1690,"advmod":1162,"etc":943,"compound:vc":828,"parataxis:prnmod":724,"advmod:loc":571,"neg":70,"acl":43,"advcl:loc":42},"4":{"ROOT":36097}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ac976c39a3cee55c41ee08d2824e3f25d2b942c6aed9d5c4058c2c1c5224e1a
3
- size 213211
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:107f1c9c7b52352006c6c3846741d048d8de965662e6f735a8a03e2f76251c8f
3
+ size 213263
tagger/cfg CHANGED
@@ -37,5 +37,6 @@
37
  "VV",
38
  "X"
39
  ],
 
40
  "overwrite":false
41
  }
 
37
  "VV",
38
  "X"
39
  ],
40
+ "neg_prefix":"!",
41
  "overwrite":false
42
  }
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:30ff3087e95e06dd4c7d3002a6193c977abe75b8d32b6220aa4cd8495d72da3b
3
- size 14345
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f36677a26354876d001dcf945d2b228e1869ca1fdade6a6ddb80ae933b99bbe
3
+ size 14397
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4e713e99fe6b9c9edf1e3e2f66fd840375ce3ea6db8d8763f290d17d17434ba2
3
- size 6811418
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bb3a79f2321e4f83c7f0838a808678175c5f74c329d2223a0e03c77e2d3e114
3
+ size 6235418
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9860bff8f8b50d10c77f43b97e932359ecb16be487fab650fd5e7ae3895101fc
3
- size 10513704
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f13a0f49d0066b44d02ec1356ec80b6a93560a7be4138dc1c533945094dadf3
3
+ size 10514537
zh_core_web_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ad7469433d4402b3d24083af28f41c8b1f7da5cd016146a843b7c35efc4745f
3
- size 603932201
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c8e4d6b436c5c5dbec0b999dd843bbc41fb20c9080f64bfc86c12032268bdb54
3
+ size 602867181