AIBoy1993's picture
Update README.md
0701438
|
raw
history blame
2.98 kB
metadata
title: Segment Anything
emoji: 🚀
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 3.24.1
app_file: app.py
pinned: false

Segment Anything WebUI

Duplicate this Space Duplicate this Space

This project is based on Segment Anything Model by Meta. The UI is based on Gradio.

Change Logs

  • [2023-4-11]
    • Support video segmentation. A short video can be automatically segmented by SAM.
    • Support text prompt segmentation using OWL-ViT (Vision Transformer for Open-World Localization) model.

Usage

Following usage is running on your computer.

pip install git+https://github.com/facebookresearch/segment-anything.git
  • git clone this repository:
git clone https://github.com/5663015/segment_anything_webui.git
  • Make a new folder named checkpoints under this project,and put the downloaded weights files in checkpoints。You can download the weights using following URLs:

  • Under checkpoints, make a new folder named models--google--owlvit-base-patch32, and put the downloaded OWL-ViT weights files in models--google--owlvit-base-patch32.

  • Run:

python app.py

Note: Default model is vit_b,the demo can run on CPU. Default device is cpu

TODO

  • Video segmentation

  • Add text prompt

  • Add segmentation prompt (point and box)

Reference