Pclanglais's picture
Create README.md
a1bceaf verified

French-TV-transcript-NER is a named-entity recognition model trained specifically on French TV headlines and transcript.

Given the format specificities, generalist multilingual or French model were unperforming. Additionally, the new model also provide additional set of entities useful in production (such as distinction between first name and last name).

Entities

The model covers twelve entities:

  • First name (prenom)
  • Last name (nom)
  • Location (lieu)
  • Country (pays)
  • Organization (organisation)
  • Event (evenement)
  • Nationality (nationalite)
  • Broadcast name (emission)
  • Product (produit), such as technological production, medicine, etc.
  • Law (loi)
  • Cultural creation (creation), such as movie titles, novels, etc.
  • Disease (maladie)