0-ma
/

vit-geometric-shapes-base

Image Classification

Model card Files Files and versions Community

0-ma commited on Sep 12

Commit

858eefd

•

1 Parent(s): 9d2be22

Update README.md

Files changed (1) hide show

README.md +47 -3

README.md CHANGED Viewed

@@ -1,3 +1,47 @@
----
-license: apache-2.0
----

+---
+base_model: google/vit-base-patch16-224
+datasets:
+- 0-ma/geometric-shapes
+license: apache-2.0
+metrics:
+- accuracy
+pipeline_tag: image-classification
+---
+# Model Card for VIT Geometric Shapes Dataset Base
+## Training Dataset
+- **Repository:** https://huggingface.co/datasets/0-ma/geometric-shapes
+## Base Model
+- **Repository:** https://huggingface.co/models/WinKawaks/vit-tiny-patch16-224
+## Accuracy
+ - Accuracy on dataset 0-ma/geometric-shapes [test] : 0.9269047619047619
+# Loading and using the model
+    import numpy as np
+    from PIL import Image
+    from transformers import AutoImageProcessor, AutoModelForImageClassification
+    import requests
+    labels =  [
+        "Only text",
+        "Circle",
+        "Triangle",
+        "Square",
+        "Pentagon",
+        "Hexagon"
+    ]
+    images = [Image.open(requests.get("https://raw.githubusercontent.com/0-ma/geometric-shape-detector/main/input/exemple_circle.jpg", stream=True).raw),
+            Image.open(requests.get("https://raw.githubusercontent.com/0-ma/geometric-shape-detector/main/input/exemple_pentagone.jpg", stream=True).raw)]
+    feature_extractor = AutoImageProcessor.from_pretrained('0-ma/vit-geometric-shapes-base')
+    model = AutoModelForImageClassification.from_pretrained('0-ma/vit-geometric-shapes-base')
+    inputs = feature_extractor(images=images, return_tensors="pt")
+    logits = model(**inputs)['logits'].cpu().detach().numpy()
+    predictions = np.argmax(logits, axis=1)
+    predicted_labels = [labels[prediction] for prediction in predictions]
+    print(predicted_labels)