Spaces:

AIBoy1993
/

segment_anything_webui

Build error

App Files Files Community

segment_anything_webui / README.md

AIBoy1993's picture

Update README.md

0701438 over 1 year ago

|

2.98 kB

metadata

title: Segment Anything
emoji: 🚀
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 3.24.1
app_file: app.py
pinned: false

Segment Anything WebUI

This project is based on Segment Anything Model by Meta. The UI is based on Gradio.

Try deme on HF: AIBoy1993/segment_anything_webui
GitHub

Change Logs

[2023-4-11]
- Support video segmentation. A short video can be automatically segmented by SAM.
- Support text prompt segmentation using OWL-ViT (Vision Transformer for Open-World Localization) model.

Usage

Following usage is running on your computer.

Install Segment Anything（more details about install Segment Anything）：

pip install git+https://github.com/facebookresearch/segment-anything.git

git clone this repository：

git clone https://github.com/5663015/segment_anything_webui.git

Make a new folder named checkpoints under this project，and put the downloaded weights files in checkpoints。You can download the weights using following URLs：
- vit_h: ViT-H SAM model
- vit_l: ViT-L SAM model
- vit_b: ViT-B SAM model
Under checkpoints, make a new folder named models--google--owlvit-base-patch32, and put the downloaded OWL-ViT weights files in models--google--owlvit-base-patch32.
Run：

python app.py

Note： Default model is vit_b，the demo can run on CPU. Default device is cpu。

TODO

Video segmentation
Add text prompt
Add segmentation prompt (point and box)

Reference

Thanks to the wonderful work Segment Anything and OWL-ViT
Some video processing code references kadirnar/segment-anything-video, and some OWL-ViT code references ngthanhtin/owlvit_segment_anything.