Add model card
Browse files
README.md
CHANGED
@@ -7,9 +7,9 @@ datasets:
|
|
7 |
- imagenet-21k
|
8 |
---
|
9 |
|
10 |
-
# BEiT (large-sized model,
|
11 |
|
12 |
-
BEiT model pre-trained in a self-supervised fashion on ImageNet-22k - also called ImageNet-21k (14 million images, 21,841 classes) at resolution 224x224
|
13 |
|
14 |
Disclaimer: The team releasing BEiT did not write a model card for this model so this model card has been written by the Hugging Face team.
|
15 |
|
@@ -32,26 +32,26 @@ fine-tuned versions on a task that interests you.
|
|
32 |
Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:
|
33 |
|
34 |
```python
|
35 |
-
from transformers import
|
36 |
from PIL import Image
|
37 |
import requests
|
|
|
38 |
url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
|
39 |
image = Image.open(requests.get(url, stream=True).raw)
|
40 |
-
|
41 |
-
|
|
|
|
|
42 |
inputs = feature_extractor(images=image, return_tensors="pt")
|
43 |
outputs = model(**inputs)
|
44 |
logits = outputs.logits
|
45 |
-
# model predicts one of the 21,841 ImageNet-22k classes
|
46 |
-
predicted_class_idx = logits.argmax(-1).item()
|
47 |
-
print("Predicted class:", model.config.id2label[predicted_class_idx])
|
48 |
```
|
49 |
|
50 |
Currently, both the feature extractor and model support PyTorch.
|
51 |
|
52 |
## Training data
|
53 |
|
54 |
-
The BEiT model was pretrained on [ImageNet-21k](http://www.image-net.org/), a dataset consisting of 14 million images and 21k classes
|
55 |
|
56 |
## Training procedure
|
57 |
|
|
|
7 |
- imagenet-21k
|
8 |
---
|
9 |
|
10 |
+
# BEiT (large-sized model, pre-trained only)
|
11 |
|
12 |
+
BEiT model pre-trained in a self-supervised fashion on ImageNet-22k - also called ImageNet-21k (14 million images, 21,841 classes) at resolution 224x224. It was introduced in the paper [BEIT: BERT Pre-Training of Image Transformers](https://arxiv.org/abs/2106.08254) by Hangbo Bao, Li Dong and Furu Wei and first released in [this repository](https://github.com/microsoft/unilm/tree/master/beit).
|
13 |
|
14 |
Disclaimer: The team releasing BEiT did not write a model card for this model so this model card has been written by the Hugging Face team.
|
15 |
|
|
|
32 |
Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:
|
33 |
|
34 |
```python
|
35 |
+
from transformers import BeitFeatureExtractor, BeitForMaskedImageModeling
|
36 |
from PIL import Image
|
37 |
import requests
|
38 |
+
|
39 |
url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
|
40 |
image = Image.open(requests.get(url, stream=True).raw)
|
41 |
+
|
42 |
+
feature_extractor = BeitFeatureExtractor.from_pretrained('microsoft/beit-large-patch16-224-pt22k')
|
43 |
+
model = BeitForMaskedImageModeling.from_pretrained('microsoft/beit-large-patch16-224-pt22k')
|
44 |
+
|
45 |
inputs = feature_extractor(images=image, return_tensors="pt")
|
46 |
outputs = model(**inputs)
|
47 |
logits = outputs.logits
|
|
|
|
|
|
|
48 |
```
|
49 |
|
50 |
Currently, both the feature extractor and model support PyTorch.
|
51 |
|
52 |
## Training data
|
53 |
|
54 |
+
The BEiT model was pretrained on [ImageNet-21k](http://www.image-net.org/), a dataset consisting of 14 million images and 21k classes.
|
55 |
|
56 |
## Training procedure
|
57 |
|