metadata
license: apache-2.0
language:
- ca
- da
- de
- en
- es
- fr
- nl
- el
- is
- it
- 'no'
- pt
- sv
pipeline_tag: text-classification
Occupational CANINE: HISCO Classification Model
Overview
OccCANINE is a version of CANINE which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages.
See more on: GitHub.com/christianvedels/OccCANINE
Read the paper on arXiv: https://arxiv.org/abs/2402.13604
Key Features
- High Accuracy: Over 90% accuracy, recall, and precision.
- Multilingual Support: Trained on 14 million description-HISCO code pairs across 13 languages.
- Efficiency: Rapidly processes descriptions into HISCO codes.
Contribution and Support
Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.