IreNkweke commited on
Commit
0954063
1 Parent(s): ab38512

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -9
README.md CHANGED
@@ -16,9 +16,9 @@ model-index:
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
- # bert-finetuned-ner-ontonotes5
20
 
21
- This model is a fine-tuned version of [dslim/bert-base-NER](https://huggingface.co/dslim/bert-base-NER) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 0.1634
24
  - Precision: 0.8620
@@ -26,19 +26,56 @@ It achieves the following results on the evaluation set:
26
  - F1: 0.8733
27
  - Accuracy: 0.9758
28
 
29
- ## Model description
30
-
31
- More information needed
32
-
33
  ## Intended uses & limitations
34
 
35
- More information needed
 
36
 
37
  ## Training and evaluation data
38
 
39
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
40
 
41
- ## Training procedure
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
 
43
  ### Training hyperparameters
44
 
@@ -68,3 +105,15 @@ The following hyperparameters were used during training:
68
  - Pytorch 2.3.0+cu121
69
  - Datasets 2.20.0
70
  - Tokenizers 0.19.1
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
+ # Model Description
20
 
21
+ This model is a fine-tuned version of [dslim/bert-base-NER](https://huggingface.co/dslim/bert-base-NER) on OntoNotes 5 dataset and is designed to identify and classify various types of entities in text, including persons, organizations, locations, dates, and more.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 0.1634
24
  - Precision: 0.8620
 
26
  - F1: 0.8733
27
  - Accuracy: 0.9758
28
 
 
 
 
 
29
  ## Intended uses & limitations
30
 
31
+ The model is intended for use in applications requiring NER, such as information extraction, text classification, and enhancing search capabilities by identifying key entities within the text. It can be used to identify entities in any English text, including news articles, social media posts, and legal documents.
32
+
33
 
34
  ## Training and evaluation data
35
 
36
+ Training Data
37
+ The model was fine-tuned on the OntoNotes 5 dataset. This dataset includes multiple types of named entities and is widely used for NER tasks. The dataset is annotated with the following entity tags:
38
+
39
+ CARDINAL: Numerical values
40
+
41
+ DATE: References to dates and periods
42
+
43
+ PERSON: Names of people
44
+
45
+ NORP: Nationalities, religious groups, political groups
46
+
47
+ GPE: Countries, cities, states
48
+
49
+ LAW: Named documents and legal entities
50
+
51
+ ORG: Organizations
52
+
53
+ PERCENT: Percentage values
54
+
55
+ ORDINAL: Ordinal numbers
56
+
57
+ MONEY: Monetary values
58
+
59
+ WORK_OF_ART: Titles of creative works
60
 
61
+ FAC: Facilities
62
+
63
+ TIME: Times smaller than a day
64
+
65
+ LOC: Non-GPE locations, mountain ranges, bodies of water
66
+
67
+ QUANTITY: Measurements, as of weight or distance
68
+
69
+ PRODUCT: Objects, vehicles, foods, etc. (not services)
70
+
71
+ EVENT: Named events
72
+
73
+ LANGUAGE: Named languages
74
+
75
+ ## Model Configuration
76
+ Base Model: dslim/bert-base-NER
77
+
78
+ Number of Labels: 37 (including the "O" tag for outside any named entity)
79
 
80
  ### Training hyperparameters
81
 
 
105
  - Pytorch 2.3.0+cu121
106
  - Datasets 2.20.0
107
  - Tokenizers 0.19.1
108
+
109
+ ## Contact Information
110
+
111
+ For questions, comments, or issues with the model, please contact:
112
+
113
+ Name: [Irechukwu Nkweke]
114
+ Email: [[email protected]]
115
+ GitHub: [https://github.com/mnkweke]
116
+
117
+ ## Acknowledgments
118
+
119
+ This model was trained using the Hugging Face transformers library and the OntoNotes 5 dataset.