Spaces:
Build error
Build error
A newer version of the Gradio SDK is available:
5.6.0
metadata
title: Segment Anything
emoji: 🚀
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 3.25.0
app_file: app.py
pinned: false
Segment Anything WebUI
This project is based on Segment Anything Model by Meta. The UI is based on Gradio.
- Try deme on HF: AIBoy1993/segment_anything_webui
- GitHub
Change Logs
- [2023-4-11]
- Support video segmentation. A short video can be automatically segmented by SAM.
- Support text prompt segmentation using OWL-ViT (Vision Transformer for Open-World Localization) model.
Usage
Following usage is running on your computer.
- Install Segment Anything(more details about install Segment Anything):
pip install git+https://github.com/facebookresearch/segment-anything.git
git clone
this repository:
git clone https://github.com/5663015/segment_anything_webui.git
Make a new folder named
checkpoints
under this project,and put the downloaded weights files incheckpoints
。You can download the weights using following URLs:vit_h
: ViT-H SAM modelvit_l
: ViT-L SAM modelvit_b
: ViT-B SAM model
Under
checkpoints
, make a new folder namedmodels--google--owlvit-base-patch32
, and put the downloaded OWL-ViT weights files inmodels--google--owlvit-base-patch32
.Run:
python app.py
Note: Default model is vit_b
,the demo can run on CPU. Default device is cpu
。
TODO
Video segmentation
Add text prompt
Add segmentation prompt (point and box)
Reference
- Thanks to the wonderful work Segment Anything and OWL-ViT
- Some video processing code references kadirnar/segment-anything-video, and some OWL-ViT code references ngthanhtin/owlvit_segment_anything.