TTS with 80M parameters
Text Behind Image using birefnet-lite for background removal
Audio-based Lip Sync for Talking Head Video Editing
Apache Licensed Advanced Video Generation Model
Depth Any Video with Scalable Synthetic Data
State-of-the-art open-vocabulary image segmentation โก๏ธ
Import a portrait, click to move the head!