iceman2434
/

xlm-roberta-base-ft-udpos213-top10lang-lr4.5e-5

Token Classification

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Model Specification

Model: XLM-RoBERTa (base-sized model)
Training Data:
- Combined Afrikaans, Hebrew, Bulgarian, Vietnamese, Norwegian, Urdu, Czech, Persian, Faroese, & English corpora (Top 10 Languages)
Training Details:
- Base configurations with a minor adjustment in learning rate (4.5e-5)

Evaluation

Evaluation Dataset: Universal Dependencies Tagalog Ugnayan (Testing Set)
Tested in a zero-shot cross-lingual scenario on a Universal Dependencies Tagalog Ugnayan testing dataset (with 77.90% Accuracy)

POS Tags

ADJ – ADP – ADV – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB

Downloads last month: 7

Inference Examples

Token Classification

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train iceman2434/xlm-roberta-base-ft-udpos213-top10lang-lr4.5e-5