widget:
- text: HILÁRIO DA SILVA RIBEIRO, LIMITADA
- text: MARININVEST, S.A.
- text: ABERTO EUROVIDA REFORMA RENDIMENTO
- text: HCapital II Fundo de Capital de Risco Fechado
- text: ILUSTRE DESCOBERTA - UNIPESSOAL LDA
- text: FUNDAÇÃO CONDUCTUS
- text: ASSOCIAÇÃO DE SOCORROS DA FREGUESIA DA ENCARNAÇÃO - ASFE SAÚDE
- text: C.E.P.-COOPERATIVA DE ENSINO POLITÉCNICO CRL
- text: Província Portuguesa dos Sacerdotes do Coração de Jesus
- text: Newbridge, Lda
- text: CP - COMBOIOS DE PORTUGAL, EPE
- text: J. O. A. P. S. - CONFECÇÃO DE MALHAS, LDA
- text: PHC-SOFTWARE,S.A.
- text: Empathy Scenery - Management Lda
- text: >-
MULTISOMA-FORNECIMENTO, MONTAGEM E MANUTENÇÃO DE EQUIPAMENTOS LDA ,
SUCURSAL EM PORTUGAL
library_name: transformers
tags: []
model-index:
- name: Sociovestix/lenu_PT
results:
- task:
type: text-classification
name: Text Classification
dataset:
name: lenu
type: Sociovestix/lenu
config: PT
split: test
revision: 76da7696c49ebee8be7f521faa76ae99189bda34
metrics:
- type: f1
value: 0.9256578947368422
name: f1
- type: f1
value: 0.3851601817071645
name: f1 macro
args:
average: macro
LENU - Legal Entity Name Understanding for Portugal
A BERT multilingual based model model fine-tuned on Portuguese legal entity names (jurisdiction PT) from the Global Legal Entity Identifier (LEI) System with the goal to detect Entity Legal Form (ELF) Codes.
in collaboration with
Model Description
The model has been created as part of a collaboration of the Global Legal Entity Identifier Foundation (GLEIF) and Sociovestix Labs with the goal to explore how Machine Learning can support in detecting the ELF Code solely based on an entity's legal name and legal jurisdiction. See also the open source python library lenu, which supports in this task.
The model has been trained on the dataset lenu, with a focus on Portuguese legal entities and ELF Codes within the Jurisdiction "PT".
- Developed by: GLEIF and Sociovestix Labs
- License: Creative Commons (CC0) license
- Finetuned from model [optional]: bert-base-multilingual-uncased
- Resources for more information: Press Release
Uses
An entity's legal form is a crucial component when verifying and screening organizational identity. The wide variety of entity legal forms that exist within and between jurisdictions, however, has made it difficult for large organizations to capture legal form as structured data. The Jurisdiction specific models of lenu, trained on entities from GLEIF’s Legal Entity Identifier (LEI) database of over two million records, will allow banks, investment firms, corporations, governments, and other large organizations to retrospectively analyze their master data, extract the legal form from the unstructured text of the legal name and uniformly apply an ELF code to each entity type, according to the ISO 20275 standard.
Licensing Information
This model, which is trained on LEI data, is available under Creative Commons (CC0) license. See gleif.org/en/about/open-data.
Recommendations
Users should always consider the score of the suggested ELF Codes. For low score values it may be necessary to manually review the affected entities.