hustvl
/

vitmatte-small-composition-1k

Inference Endpoints

Model card Files Files and versions Community

nielsr HF staff commited on Sep 21, 2023

Commit

03f5646

•

1 Parent(s): f331a72

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -14,6 +14,11 @@ Disclaimer: The team releasing ViTMatte did not write a model card for this mode
 ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a Vision Transformer (ViT) with a lightweight head on top.
 ## Intended uses & limitations
 You can use the raw model for image matting. See the [model hub](https://huggingface.co/models?search=vitmatte) to look for other

 ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a Vision Transformer (ViT) with a lightweight head on top.
+<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/vitmatte_architecture.png"
+alt="drawing" width="600"/>
+<small> ViTMatte high-level overview. Taken from the <a href="https://arxiv.org/abs/2305.15272">original paper.</a> </small>
 ## Intended uses & limitations
 You can use the raw model for image matting. See the [model hub](https://huggingface.co/models?search=vitmatte) to look for other