|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- AyoubChLin/CNN_News_Articles_2011-2022 |
|
language: |
|
- en |
|
metrics: |
|
- accuracy |
|
pipeline_tag: zero-shot-classification |
|
--- |
|
|
|
# DistilBERT for Zero Shot Classification |
|
|
|
This repository contains a DistilBERT model trained for zero-shot classification on CNN articles. The model has been evaluated on CNN articles and achieved an accuracy of 0.956 and an F1 score of 0.955. |
|
|
|
## Model Details |
|
- Architecture: DistilBERT |
|
- Training Data: CNN articles |
|
- Accuracy: 0.956 |
|
- F1 Score: 0.955 |
|
|
|
## Usage |
|
|
|
To use this model for zero-shot classification, you can follow the steps below: |
|
|
|
|
|
|
|
1. Load the trained model: |
|
```python |
|
|
|
from transformers import AutoTokenizer, AutoModelForSequenceClassification |
|
|
|
tokenizer = AutoTokenizer.from_pretrained("AyoubChLin/DistilBERT_ZeroShot") |
|
|
|
model = AutoModelForSequenceClassification.from_pretrained("AyoubChLin/DistilBERT_ZeroShot") |
|
|
|
``` |
|
|
|
4. Classify text using zero-shot classification: |
|
|
|
```python |
|
|
|
|
|
from transformers import pipeline |
|
|
|
# Create a zero-shot classification pipeline |
|
classifier = pipeline("zero-shot-classification", model=model, tokenizer=tokenizer) |
|
|
|
# Classify a sentence |
|
sentence = "The latest scientific breakthroughs in medicine" |
|
candidate_labels = ["politics", "sports", "technology", "business"] |
|
|
|
result = classifier(sentence, candidate_labels) |
|
|
|
print(result) |
|
|
|
``` |
|
|
|
The output will be a dictionary containing the classified label and the corresponding classification score. |
|
|
|
## About the Author |
|
|
|
This work was created by Ayoub Cherguelaine. |
|
|
|
If you have any questions or suggestions regarding this repository or the trained model, feel free to reach out to Ayoub Cherguelaine. |