|
--- |
|
tags: |
|
- spacy |
|
- token-classification |
|
language: |
|
- multilingual |
|
model-index: |
|
- name: xx_fro_sigtyp_trf |
|
results: |
|
- task: |
|
name: TAG |
|
type: token-classification |
|
metrics: |
|
- name: TAG (XPOS) Accuracy |
|
type: accuracy |
|
value: 0.8910235177 |
|
- task: |
|
name: POS |
|
type: token-classification |
|
metrics: |
|
- name: POS (UPOS) Accuracy |
|
type: accuracy |
|
value: 0.890459364 |
|
- task: |
|
name: MORPH |
|
type: token-classification |
|
metrics: |
|
- name: Morph (UFeats) Accuracy |
|
type: accuracy |
|
value: 0.9118816254 |
|
- task: |
|
name: LEMMA |
|
type: token-classification |
|
metrics: |
|
- name: Lemma Accuracy |
|
type: accuracy |
|
value: 0.8443364981 |
|
- task: |
|
name: UNLABELED_DEPENDENCIES |
|
type: token-classification |
|
metrics: |
|
- name: Unlabeled Attachment Score (UAS) |
|
type: f_score |
|
value: 0.7518266566 |
|
- task: |
|
name: LABELED_DEPENDENCIES |
|
type: token-classification |
|
metrics: |
|
- name: Labeled Attachment Score (LAS) |
|
type: f_score |
|
value: 0.6812799194 |
|
- task: |
|
name: SENTS |
|
type: token-classification |
|
metrics: |
|
- name: Sentences F-Score |
|
type: f_score |
|
value: 0.9002493766 |
|
--- |
|
| Feature | Description | |
|
| --- | --- | |
|
| **Name** | `xx_fro_sigtyp_trf` | |
|
| **Version** | `0.1.0` | |
|
| **spaCy** | `>=3.6.1,<3.7.0` | |
|
| **Default Pipeline** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` | |
|
| **Components** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` | |
|
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) | |
|
| **Sources** | n/a | |
|
| **License** | n/a | |
|
| **Author** | [n/a]() | |
|
|
|
### Label Scheme |
|
|
|
<details> |
|
|
|
<summary>View label scheme (190 labels for 3 components)</summary> |
|
|
|
| Component | Labels | |
|
| --- | --- | |
|
| **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `case:det`, `cc`, `cc:nc`, `ccomp`, `conj`, `cop`, `csubj`, `dep`, `det`, `expl`, `flat`, `iobj`, `mark`, `nmod`, `nsubj`, `nummod`, `obj`, `obl`, `obl:advmod`, `parataxis`, `punct`, `vocative`, `xcomp` | |
|
| **`tagger`** | `ADJcar__NumType=Card`, `ADJind__PronType=Ind`, `ADJord`, `ADJpos__Poss=Yes`, `ADJqua`, `ADJqua__Tense=Past\|VerbForm=Part`, `ADJqua__Tense=Pres\|VerbForm=Part`, `ADVgen`, `ADVgen.PROper`, `ADVgen__PronType=Ind`, `ADVgen__PronType=Prs,Rel`, `ADVgen__PronType=Rel`, `ADVint__PronType=Int`, `ADVneg`, `ADVneg.PROper__Polarity=Neg\|PronType=Prs`, `ADVneg__Polarity=Neg`, `ADVsub`, `CONcoo`, `CONcoo__PronType=Prs,Rel`, `CONsub`, `CONsub.PROper`, `CONsub__PronType=Rel`, `DETcar__NumType=Card`, `DETdef__Definite=Def`, `DETdef__Definite=Def\|PronType=Art`, `DETdef__PronType=Ind`, `DETdef__PronType=Prs`, `DETdem__PronType=Dem`, `DETdem__PronType=Prs`, `DETind__PronType=Ind`, `DETint__PronType=Int`, `DETndf__Definite=Ind`, `DETndf__Definite=Ind\|PronType=Art`, `DETord`, `DETord__NumType=Ord`, `DETpos__Poss=Yes`, `DETrel__PronType=Rel`, `INJ`, `NOMcom`, `NOMcom__Morph=VFin`, `NOMcom__VerbForm=Inf`, `NOMpro`, `PONfbl`, `PONfrt`, `PONpdr`, `PONpga`, `PONpxx`, `PRE`, `PRE.DETdef__Definite=Def\|PronType=Art`, `PRE.PROper`, `PRE.PROper__Definite=Def\|PronType=Art`, `PRE__Morph=VFin`, `PRE__PronType=Dem`, `PREdetdef__PronType=Prs,Rel`, `PROadv`, `PROadv__PronType=Dem`, `PROcar`, `PROcar__NumType=Card`, `PROdem__PronType=Dem`, `PROdem__PronType=Prs,Rel`, `PROimp`, `PROimp__PronType=Prs`, `PROind`, `PROind__PronType=Ind`, `PROind__PronType=Rel`, `PROint__PronType=Int`, `PROord__NumType=Ord`, `PROper`, `PROper.PROper__PronType=Prs`, `PROper__Poss=Yes`, `PROper__PronType=Prs`, `PROpos__Poss=Yes`, `PROpos__Poss=Yes\|PronType=Prs`, `PROrel`, `PROrel__PronType=Prs,Rel`, `PROrel__PronType=Rel`, `RED`, `VERcjg`, `VERcjg__VerbForm=Fin`, `VERcjg__VerbForm=Inf`, `VERinf__VerbForm=Inf`, `VERppa__Tense=Pres\|VerbForm=Part`, `VERppe`, `VERppe__Tense=Past`, `VERppe__Tense=Past\|VerbForm=Part`, `devenir__Tense=Past\|VerbForm=Part`, `devenir__VerbForm=Fin`, `laisser__VerbForm=Fin`, `remanoir__VerbForm=Fin`, `ressembler__VerbForm=Fin`, `sembler__VerbForm=Fin` | |
|
| **`morphologizer`** | `POS=ADV`, `POS=PRON\|PronType=Prs`, `POS=ADV\|PronType=Dem`, `POS=VERB\|VerbForm=Fin`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=PUNCT`, `POS=CCONJ`, `Definite=Def\|POS=DET\|PronType=Art`, `POS=NOUN`, `POS=DET\|PronType=Ind`, `POS=SCONJ`, `Definite=Def\|POS=ADP\|PronType=Art`, `NumType=Card\|POS=PRON`, `POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `POS=DET\|PronType=Rel`, `POS=PRON\|PronType=Prs,Rel`, `POS=ADP`, `POS=ADJ`, `POS=PROPN`, `POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Ind`, `POS=ADV\|Polarity=Neg`, `NumType=Card\|POS=NUM`, `POS=AUX\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `POS=ADV\|PronType=Ind`, `POS=ADJ\|PronType=Ind`, `POS=DET\|PronType=Dem`, `POS=INTJ`, `POS=ADJ\|Poss=Yes`, `POS=ADV\|PronType=Int`, `POS=PRON`, `NumType=Ord\|POS=PRON`, `POS=VERB`, `POS=ADJ\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Int`, `POS=SCONJ\|PronType=Prs,Rel`, `POS=PRON\|Polarity=Neg\|PronType=Prs`, `POS=SCONJ\|PronType=Rel`, `POS=PRON\|Poss=Yes\|PronType=Prs`, `NumType=Card\|POS=DET`, `POS=NUM`, `POS=DET\|PronType=Prs`, `NumType=Card\|POS=ADJ`, `NumType=Ord\|POS=DET`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=CCONJ\|PronType=Prs,Rel`, `Morph=VFin\|POS=ADP`, `POS=DET\|PronType=Int`, `POS=ADJ\|Tense=Pres\|VerbForm=Part`, `Morph=VFin\|POS=NOUN`, `POS=PRON\|Poss=Yes`, `POS=AUX`, `POS=ADV\|PronType=Rel`, `POS=PRON\|PronType=Rel`, `POS=SCONJ\|PronType=Prs`, `POS=ADP\|PronType=Prs,Rel`, `POS=NOUN\|VerbForm=Inf`, `Definite=Def\|POS=DET`, `POS=VERB\|Tense=Past`, `Definite=Ind\|POS=DET`, `POS=ADP\|PronType=Dem`, `POS=ADV\|PronType=Prs,Rel` | |
|
|
|
</details> |
|
|
|
### Accuracy |
|
|
|
| Type | Score | |
|
| --- | --- | |
|
| `DEP_UAS` | 75.18 | |
|
| `DEP_LAS` | 68.13 | |
|
| `SENTS_P` | 87.41 | |
|
| `SENTS_R` | 92.80 | |
|
| `SENTS_F` | 90.02 | |
|
| `LEMMA_ACC` | 84.43 | |
|
| `TAG_ACC` | 89.10 | |
|
| `POS_ACC` | 89.05 | |
|
| `MORPH_ACC` | 91.19 | |
|
| `TRANSFORMER_LOSS` | 130913.68 | |
|
| `PARSER_LOSS` | 16324.89 | |
|
| `TRAINABLE_LEMMATIZER_LOSS` | 904.27 | |
|
| `TAGGER_LOSS` | 4331.12 | |
|
| `MORPHOLOGIZER_LOSS` | 4719.16 | |
|
|
|
|
|
### Citation |
|
|
|
If you're using this model, please cite: |
|
|
|
``` |
|
@inproceedings{miranda-2024-allen, |
|
title = "{A}llen Institute for {AI} @ {SIGTYP} 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages", |
|
author = "Miranda, Lester James", |
|
booktitle = "Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP", |
|
month = mar, |
|
year = "2024", |
|
address = "St. Julian's, Malta", |
|
publisher = "Association for Computational Linguistics", |
|
url = "https://aclanthology.org/2024.sigtyp-1.18", |
|
pages = "151--159", |
|
} |
|
``` |