|
--- |
|
base_model: |
|
- dslim/bert-base-NER |
|
pipeline_tag: token-classification |
|
tags: |
|
- token-classification |
|
- pytorch |
|
- transformers |
|
- named-entity-recognition |
|
metrics: |
|
- seqeval |
|
--- |
|
|
|
# bert-base-mountain-NER |
|
|
|
This model is a specialized adaptation of [dslim/bert-base-NER](https://huggingface.co/dslim/bert-base-NER), tailored for recognizing mountain names with a focus on geographical texts. Unlike the original, this model retains all 12 hidden layers and has been specifically fine-tuned to achieve high precision in identifying mountain-related entities across diverse texts. |
|
|
|
It is ideal for applications that involve extracting geographic information from travel literature, research documents, or any content related to natural landscapes. |
|
|
|
## Dataset |
|
|
|
The model was trained using approximately 115 samples generated specifically for mountain name recognition. These samples were created with the assistance of ChatGPT, focusing on realistic use cases for mountain-related content in the NER format. |
|
|
|
## How to Use |
|
|
|
You can easily integrate this model with the Transformers library's NER pipeline: |
|
|
|
```python |
|
import torch |
|
from transformers import AutoTokenizer, AutoModelForTokenClassification |
|
from transformers import pipeline |
|
|
|
device = "cuda" if torch.cuda.is_available() else "cpu" |
|
|
|
# Load model and tokenizer |
|
model_name = "Lizrek/bert-base-mountain-NER" |
|
tokenizer = AutoTokenizer.from_pretrained(model_name) |
|
model = AutoModelForTokenClassification.from_pretrained(model_name) |
|
|
|
# Create a pipeline for NER |
|
nlp = pipeline("ner", model=model, tokenizer=tokenizer) |
|
|
|
# Example usage |
|
example = "Mount Fuji in Japan are example of volcanic mountain.." |
|
ner_results = nlp(example) |
|
print(ner_results) |
|
``` |
|
|
|
## Example Output |
|
|
|
For the above input, the model provides the following output: |
|
|
|
```python |
|
[{'entity': 'B-MOUNTAIN_NAME', 'score': np.float32(0.9827131), 'index': 1, 'word': 'Mount', 'start': 0, 'end': 5}, {'entity': 'I-MOUNTAIN_NAME', 'score': np.float32(0.98952174), 'index': 2, 'word': 'Fuji', 'start': 6, 'end': 10}] |
|
``` |
|
|
|
This output highlights recognized mountain names, providing metadata such as entity type, confidence score, and word position. |
|
|
|
## Limitations |
|
|
|
- The model is specialized for mountain names and may not be effective in recognizing other types of geographical entities such as rivers or lakes. |
|
- If the input text is significantly different from the training data in style or terminology, accuracy may be affected. |