File size: 4,329 Bytes
5ef103f 2f2f493 5ef103f 653ea4f 64a4171 5ef103f b66b76d 5ef103f 15937bf b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 15937bf b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d 5ef103f b66b76d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 |
---
widget:
- text: Great New World Equity LLC
- text: Jerome J Glaser DBA International Curtain Call
- text: PICKERING INTERNATIONAL, INC.
- text: R & S BEVERAGE COMPANY, A CALIFORNIA LIMITED PARTNERSHIP
- text: Kehilla Community Synagogue and School
- text: Temarry Recycling, Inc.
- text: Jacobus De Groot Dairy
- text: United Brands Products Design Development and Marketing, Inc.
- text: GREEN VALLEY COMMUNITY CHURCH
- text: Sierra Central Credit Union
- text: Tri Counties Bank
- text: Lowkey Community LLC
model-index:
- name: Sociovestix/US-CA
results:
- task:
type: text-classification
name: Text Classification
dataset:
name: lenu
type: Sociovestix/lenu
config: US-CA
split: test
revision: f4d57b8d77a49ec5c62d899c9a213d23cd9f9428
metrics:
- type: f1
value: 0.954945054945055
name: f1
- type: f1
value: 0.5781293443200107
name: f1 macro
args:
average: macro
---
# LENU - Legal Entity Name Understanding for US California
A [BERT](https://huggingface.co/google-bert/bert-base-uncased) based model model fine-tuned on US California legal entity names (jurisdiction US-CA) from the Global [Legal Entity Identifier](https://www.gleif.org/en/about-lei/introducing-the-legal-entity-identifier-lei)
(LEI) System with the goal to detect [Entity Legal Form (ELF) Codes](https://www.gleif.org/en/about-lei/code-lists/iso-20275-entity-legal-forms-code-list).
---------------
<h1 align="center">
<a href="https://gleif.org">
<img src="http://sdglabs.ai/wp-content/uploads/2022/07/gleif-logo-new.png" width="220px" style="display: inherit">
</a>
</h1><br>
<h3 align="center">in collaboration with</h3>
<h1 align="center">
<a href="https://sociovestix.com">
<img src="https://sociovestix.com/img/svl_logo_centered.svg" width="700px" style="width: 100%">
</a>
</h1><br>
---------------
## Model Description
<!-- Provide a longer summary of what this model is. -->
The model has been created as part of a collaboration of the [Global Legal Entity Identifier Foundation](https://gleif.org) (GLEIF) and
[Sociovestix Labs](https://sociovestix.com) with the goal to explore how Machine Learning can support in detecting the ELF Code solely based on an entity's legal name and legal jurisdiction.
See also the open source python library [lenu](https://github.com/Sociovestix/lenu), which supports in this task.
The model has been trained on the dataset [lenu](https://huggingface.co/datasets/Sociovestix), with a focus on US California legal entities and ELF Codes within the Jurisdiction "US-CA".
- **Developed by:** [GLEIF](https://gleif.org) and [Sociovestix Labs](https://huggingface.co/Sociovestix)
- **License:** Creative Commons (CC0) license
- **Finetuned from model [optional]:** bert-base-uncased
- **Resources for more information:** [Press Release](https://www.gleif.org/en/newsroom/press-releases/machine-learning-new-open-source-tool-developed-by-gleif-and-sociovestix-labs-enables-organizations-everywhere-to-automatically-)
# Uses
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
An entity's legal form is a crucial component when verifying and screening organizational identity.
The wide variety of entity legal forms that exist within and between jurisdictions, however, has made it difficult for large organizations to capture legal form as structured data.
The Jurisdiction specific models of [lenu](https://github.com/Sociovestix/lenu), trained on entities from
GLEIF’s Legal Entity Identifier (LEI) database of over two million records, will allow banks,
investment firms, corporations, governments, and other large organizations to retrospectively analyze
their master data, extract the legal form from the unstructured text of the legal name and
uniformly apply an ELF code to each entity type, according to the ISO 20275 standard.
# Licensing Information
This model, which is trained on LEI data, is available under Creative Commons (CC0) license.
See [gleif.org/en/about/open-data](https://gleif.org/en/about/open-data).
# Recommendations
Users should always consider the score of the suggested ELF Codes. For low score values it may be necessary to manually review the affected entities. |