File size: 1,651 Bytes
dd3bfbd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d1c6738
dd3bfbd
 
 
 
 
 
 
724ecf1
dd3bfbd
 
 
 
 
 
 
 
 
 
 
 
 
 
2f3e24c
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
base_model: nvidia/mit-b0
datasets:
- 0-ma/geometric-shapes
license: other
metrics:
- accuracy
pipeline_tag: image-classification
---
# Model Card for Mit-B0 Geometric Shapes Dataset

## Training Dataset

- **Repository:** https://huggingface.co/datasets/0-ma/geometric-shapes

## Base Model

- **Repository:** https://huggingface.co/models/nvidia/mit-b0

## Accuracy

 - Accuracy on dataset 0-ma/geometric-shapes [test] : 0.8628571428571429

# Loading and using the model
    import numpy as np
    from PIL import Image
    from transformers import AutoImageProcessor, AutoModelForImageClassification 
    import requests
    labels =  [
        "None",
        "Circle",
        "Triangle",
        "Square",
        "Pentagon",
        "Hexagon"
    ] 
    images = [Image.open(requests.get("https://raw.githubusercontent.com/0-ma/geometric-shape-detector/main/input/exemple_circle.jpg", stream=True).raw), 
            Image.open(requests.get("https://raw.githubusercontent.com/0-ma/geometric-shape-detector/main/input/exemple_pentagone.jpg", stream=True).raw)]
    feature_extractor = AutoImageProcessor.from_pretrained('0-ma/mit-b0-geometric-shapes')
    model = AutoModelForImageClassification.from_pretrained('0-ma/mit-b0-geometric-shapes')
    inputs = feature_extractor(images=images, return_tensors="pt")
    logits = model(**inputs)['logits'].cpu().detach().numpy()
    predictions = np.argmax(logits, axis=1)    
    predicted_labels = [labels[prediction] for prediction in predictions]
    print(predicted_labels)

### License

The license for this model can be found [here](https://github.com/NVlabs/SegFormer/blob/master/LICENSE).