OccCANINE / README.md
Christianvedel's picture
Update README.md
9d13433 verified
metadata
license: apache-2.0
language:
  - ca
  - da
  - de
  - en
  - es
  - fr
  - nl
  - el
  - is
  - it
  - 'no'
  - pt
  - sv
pipeline_tag: text-classification

Occupational CANINE: HISCO Classification Model

Overview

OccCANINE is a version of CANINE which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages.

See more on: GitHub.com/christianvedels/OccCANINE

Read the paper on arXiv: https://arxiv.org/abs/2402.13604

Key Features

  • High Accuracy: Over 90% accuracy, recall, and precision.
  • Multilingual Support: Trained on 14 million description-HISCO code pairs across 13 languages.
  • Efficiency: Rapidly processes descriptions into HISCO codes.

Contribution and Support

Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.