Update README.md
Browse files
README.md
CHANGED
@@ -7,8 +7,8 @@ tags:
|
|
7 |
# Model Card for SlimSAM (compressed version of SAM = Segment Anything)
|
8 |
|
9 |
<p>
|
10 |
-
<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/
|
11 |
-
<em>
|
12 |
</p>
|
13 |
|
14 |
# Table of Contents
|
@@ -20,14 +20,13 @@ tags:
|
|
20 |
|
21 |
# TL;DR
|
22 |
|
23 |
-
[
|
24 |
-
| <img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/sam-beancans.png" alt="Snow" width="600" height="600"> | <img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/sam-dog-masks.png" alt="Forest" width="600" height="600"> | <img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/sam-car-seg.png" alt="Mountains" width="600" height="600"> |
|
25 |
-
|---------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------|
|
26 |
|
27 |
-
The **Segment Anything Model (SAM)** produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image. It has been trained on a [dataset](https://segment-anything.com/dataset/index.html) of 11 million images and 1.1 billion masks, and has strong zero-shot performance on a variety of segmentation tasks.
|
28 |
The abstract of the paper states:
|
29 |
|
30 |
-
>
|
|
|
|
|
31 |
|
32 |
**Disclaimer**: Content from **this** model card has been written by the Hugging Face team, and parts of it were copy pasted from the original [SAM model card](https://github.com/facebookresearch/segment-anything).
|
33 |
|
|
|
7 |
# Model Card for SlimSAM (compressed version of SAM = Segment Anything)
|
8 |
|
9 |
<p>
|
10 |
+
<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/slimsam_overview.png" alt="Model architecture">
|
11 |
+
<em> Overview of SlimSAM and its differences to alternatives.</em>
|
12 |
</p>
|
13 |
|
14 |
# Table of Contents
|
|
|
20 |
|
21 |
# TL;DR
|
22 |
|
23 |
+
SlimSAM is a compressed (pruned) version of the [Segment Anything (SAM)](https://huggingface.co/docs/transformers/model_doc/sam) model, capabling of producing high quality object masks from input prompts such as points or boxes.
|
|
|
|
|
24 |
|
|
|
25 |
The abstract of the paper states:
|
26 |
|
27 |
+
> The formidable model size and demanding computational requirements of Segment Anything Model (SAM) have rendered it cumbersome for deployment on resource-constrained devices. Existing approaches for SAM compression typically involve training a new network from scratch, posing a challenging trade-off between compression costs and model performance. To address this issue, this paper introduces SlimSAM, a novel SAM compression method that achieves superior performance with remarkably low training costs. This is achieved by the efficient reuse of pre-trained SAMs through a unified pruning-distillation framework. To enhance knowledge inheritance from the original SAM, we employ an innovative alternate slimming strategy that partitions the compression process into a progressive procedure. Diverging from prior pruning techniques, we meticulously prune and distill decoupled model structures in an alternating fashion. Furthermore, a novel label-free pruning criterion is also proposed to align the pruning objective with the optimization target, thereby boosting the post-distillation after pruning. SlimSAM yields significant performance improvements while demanding over 10 times less training costs than any other existing methods. Even when compared to the original SAM-H, SlimSAM achieves approaching performance while reducing parameter counts to merely 0.9% (5.7M), MACs to 0.8% (21G), and requiring only 0.1% (10k) of the SAM training data.
|
28 |
+
|
29 |
+
[Link to original repository](https://github.com/czg1225/SlimSAM)
|
30 |
|
31 |
**Disclaimer**: Content from **this** model card has been written by the Hugging Face team, and parts of it were copy pasted from the original [SAM model card](https://github.com/facebookresearch/segment-anything).
|
32 |
|