|
--- |
|
license: apache-2.0 |
|
language: |
|
- ca |
|
- da |
|
- de |
|
- en |
|
- es |
|
- fr |
|
- nl |
|
- el |
|
- is |
|
- it |
|
- 'no' |
|
- pt |
|
- sv |
|
pipeline_tag: text-classification |
|
--- |
|
# Occupational CANINE: HISCO Classification Model |
|
|
|
## Overview |
|
OccCANINE is a version of [CANINE](https://huggingface.co/google/canine-s) which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages. |
|
|
|
See more on: [GitHub.com/christianvedels/OccCANINE](https://github.com/christianvedels/OccCANINE) |
|
|
|
Read the paper on arXiv: [https://arxiv.org/abs/2402.13604](https://arxiv.org/abs/2402.13604) |
|
|
|
## Key Features |
|
- **High Accuracy**: Over 90% accuracy, recall, and precision. |
|
- **Multilingual Support**: Trained on 14 million description-HISCO code pairs across 13 languages. |
|
- **Efficiency**: Rapidly processes descriptions into HISCO codes. |
|
|
|
## Contribution and Support |
|
Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources. |
|
|