Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,11 @@ Disclaimer: The team releasing ViTMatte did not write a model card for this mode
|
|
14 |
|
15 |
ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a Vision Transformer (ViT) with a lightweight head on top.
|
16 |
|
|
|
|
|
|
|
|
|
|
|
17 |
## Intended uses & limitations
|
18 |
|
19 |
You can use the raw model for image matting. See the [model hub](https://huggingface.co/models?search=vitmatte) to look for other
|
|
|
14 |
|
15 |
ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a Vision Transformer (ViT) with a lightweight head on top.
|
16 |
|
17 |
+
<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/vitmatte_architecture.png"
|
18 |
+
alt="drawing" width="600"/>
|
19 |
+
|
20 |
+
<small> ViTMatte high-level overview. Taken from the <a href="https://arxiv.org/abs/2305.15272">original paper.</a> </small>
|
21 |
+
|
22 |
## Intended uses & limitations
|
23 |
|
24 |
You can use the raw model for image matting. See the [model hub](https://huggingface.co/models?search=vitmatte) to look for other
|