File size: 6,818 Bytes
e13be7b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0855f19
b3204c0
 
 
 
 
 
0855f19
b3204c0
 
 
 
 
 
0855f19
e13be7b
 
 
 
 
 
0855f19
e13be7b
 
 
 
 
 
0855f19
e13be7b
 
 
 
 
 
0855f19
e13be7b
 
 
 
 
 
0855f19
e13be7b
 
 
 
 
 
b3204c0
 
e13be7b
 
 
 
 
 
 
 
 
b3204c0
e13be7b
 
 
 
 
b3204c0
e13be7b
 
 
 
 
 
 
0855f19
 
 
 
 
 
 
 
 
 
 
 
 
ac6814f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
---
tags:
- spacy
- token-classification
language:
- multilingual
model-index:
- name: xx_fro_sigtyp_trf
  results:
  - task:
      name: TAG
      type: token-classification
    metrics:
    - name: TAG (XPOS) Accuracy
      type: accuracy
      value: 0.8910235177
  - task:
      name: POS
      type: token-classification
    metrics:
    - name: POS (UPOS) Accuracy
      type: accuracy
      value: 0.890459364
  - task:
      name: MORPH
      type: token-classification
    metrics:
    - name: Morph (UFeats) Accuracy
      type: accuracy
      value: 0.9118816254
  - task:
      name: LEMMA
      type: token-classification
    metrics:
    - name: Lemma Accuracy
      type: accuracy
      value: 0.8443364981
  - task:
      name: UNLABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Unlabeled Attachment Score (UAS)
      type: f_score
      value: 0.7518266566
  - task:
      name: LABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Labeled Attachment Score (LAS)
      type: f_score
      value: 0.6812799194
  - task:
      name: SENTS
      type: token-classification
    metrics:
    - name: Sentences F-Score
      type: f_score
      value: 0.9002493766
---
| Feature | Description |
| --- | --- |
| **Name** | `xx_fro_sigtyp_trf` |
| **Version** | `0.1.0` |
| **spaCy** | `>=3.6.1,<3.7.0` |
| **Default Pipeline** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` |
| **Components** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | n/a |
| **License** | n/a |
| **Author** | [n/a]() |

### Label Scheme

<details>

<summary>View label scheme (190 labels for 3 components)</summary>

| Component | Labels |
| --- | --- |
| **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `case:det`, `cc`, `cc:nc`, `ccomp`, `conj`, `cop`, `csubj`, `dep`, `det`, `expl`, `flat`, `iobj`, `mark`, `nmod`, `nsubj`, `nummod`, `obj`, `obl`, `obl:advmod`, `parataxis`, `punct`, `vocative`, `xcomp` |
| **`tagger`** | `ADJcar__NumType=Card`, `ADJind__PronType=Ind`, `ADJord`, `ADJpos__Poss=Yes`, `ADJqua`, `ADJqua__Tense=Past\|VerbForm=Part`, `ADJqua__Tense=Pres\|VerbForm=Part`, `ADVgen`, `ADVgen.PROper`, `ADVgen__PronType=Ind`, `ADVgen__PronType=Prs,Rel`, `ADVgen__PronType=Rel`, `ADVint__PronType=Int`, `ADVneg`, `ADVneg.PROper__Polarity=Neg\|PronType=Prs`, `ADVneg__Polarity=Neg`, `ADVsub`, `CONcoo`, `CONcoo__PronType=Prs,Rel`, `CONsub`, `CONsub.PROper`, `CONsub__PronType=Rel`, `DETcar__NumType=Card`, `DETdef__Definite=Def`, `DETdef__Definite=Def\|PronType=Art`, `DETdef__PronType=Ind`, `DETdef__PronType=Prs`, `DETdem__PronType=Dem`, `DETdem__PronType=Prs`, `DETind__PronType=Ind`, `DETint__PronType=Int`, `DETndf__Definite=Ind`, `DETndf__Definite=Ind\|PronType=Art`, `DETord`, `DETord__NumType=Ord`, `DETpos__Poss=Yes`, `DETrel__PronType=Rel`, `INJ`, `NOMcom`, `NOMcom__Morph=VFin`, `NOMcom__VerbForm=Inf`, `NOMpro`, `PONfbl`, `PONfrt`, `PONpdr`, `PONpga`, `PONpxx`, `PRE`, `PRE.DETdef__Definite=Def\|PronType=Art`, `PRE.PROper`, `PRE.PROper__Definite=Def\|PronType=Art`, `PRE__Morph=VFin`, `PRE__PronType=Dem`, `PREdetdef__PronType=Prs,Rel`, `PROadv`, `PROadv__PronType=Dem`, `PROcar`, `PROcar__NumType=Card`, `PROdem__PronType=Dem`, `PROdem__PronType=Prs,Rel`, `PROimp`, `PROimp__PronType=Prs`, `PROind`, `PROind__PronType=Ind`, `PROind__PronType=Rel`, `PROint__PronType=Int`, `PROord__NumType=Ord`, `PROper`, `PROper.PROper__PronType=Prs`, `PROper__Poss=Yes`, `PROper__PronType=Prs`, `PROpos__Poss=Yes`, `PROpos__Poss=Yes\|PronType=Prs`, `PROrel`, `PROrel__PronType=Prs,Rel`, `PROrel__PronType=Rel`, `RED`, `VERcjg`, `VERcjg__VerbForm=Fin`, `VERcjg__VerbForm=Inf`, `VERinf__VerbForm=Inf`, `VERppa__Tense=Pres\|VerbForm=Part`, `VERppe`, `VERppe__Tense=Past`, `VERppe__Tense=Past\|VerbForm=Part`, `devenir__Tense=Past\|VerbForm=Part`, `devenir__VerbForm=Fin`, `laisser__VerbForm=Fin`, `remanoir__VerbForm=Fin`, `ressembler__VerbForm=Fin`, `sembler__VerbForm=Fin` |
| **`morphologizer`** | `POS=ADV`, `POS=PRON\|PronType=Prs`, `POS=ADV\|PronType=Dem`, `POS=VERB\|VerbForm=Fin`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=PUNCT`, `POS=CCONJ`, `Definite=Def\|POS=DET\|PronType=Art`, `POS=NOUN`, `POS=DET\|PronType=Ind`, `POS=SCONJ`, `Definite=Def\|POS=ADP\|PronType=Art`, `NumType=Card\|POS=PRON`, `POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `POS=DET\|PronType=Rel`, `POS=PRON\|PronType=Prs,Rel`, `POS=ADP`, `POS=ADJ`, `POS=PROPN`, `POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Ind`, `POS=ADV\|Polarity=Neg`, `NumType=Card\|POS=NUM`, `POS=AUX\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `POS=ADV\|PronType=Ind`, `POS=ADJ\|PronType=Ind`, `POS=DET\|PronType=Dem`, `POS=INTJ`, `POS=ADJ\|Poss=Yes`, `POS=ADV\|PronType=Int`, `POS=PRON`, `NumType=Ord\|POS=PRON`, `POS=VERB`, `POS=ADJ\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Int`, `POS=SCONJ\|PronType=Prs,Rel`, `POS=PRON\|Polarity=Neg\|PronType=Prs`, `POS=SCONJ\|PronType=Rel`, `POS=PRON\|Poss=Yes\|PronType=Prs`, `NumType=Card\|POS=DET`, `POS=NUM`, `POS=DET\|PronType=Prs`, `NumType=Card\|POS=ADJ`, `NumType=Ord\|POS=DET`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=CCONJ\|PronType=Prs,Rel`, `Morph=VFin\|POS=ADP`, `POS=DET\|PronType=Int`, `POS=ADJ\|Tense=Pres\|VerbForm=Part`, `Morph=VFin\|POS=NOUN`, `POS=PRON\|Poss=Yes`, `POS=AUX`, `POS=ADV\|PronType=Rel`, `POS=PRON\|PronType=Rel`, `POS=SCONJ\|PronType=Prs`, `POS=ADP\|PronType=Prs,Rel`, `POS=NOUN\|VerbForm=Inf`, `Definite=Def\|POS=DET`, `POS=VERB\|Tense=Past`, `Definite=Ind\|POS=DET`, `POS=ADP\|PronType=Dem`, `POS=ADV\|PronType=Prs,Rel` |

</details>

### Accuracy

| Type | Score |
| --- | --- |
| `DEP_UAS` | 75.18 |
| `DEP_LAS` | 68.13 |
| `SENTS_P` | 87.41 |
| `SENTS_R` | 92.80 |
| `SENTS_F` | 90.02 |
| `LEMMA_ACC` | 84.43 |
| `TAG_ACC` | 89.10 |
| `POS_ACC` | 89.05 |
| `MORPH_ACC` | 91.19 |
| `TRANSFORMER_LOSS` | 130913.68 |
| `PARSER_LOSS` | 16324.89 |
| `TRAINABLE_LEMMATIZER_LOSS` | 904.27 |
| `TAGGER_LOSS` | 4331.12 |
| `MORPHOLOGIZER_LOSS` | 4719.16 |


### Citation

If you're using this model, please cite:

```
@inproceedings{miranda-2024-allen,
    title = "{A}llen Institute for {AI} @ {SIGTYP} 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages",
    author = "Miranda, Lester James",
    booktitle = "Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP",
    month = mar,
    year = "2024",
    address = "St. Julian's, Malta",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.sigtyp-1.18",
    pages = "151--159",
}
```