zhengchong
commited on
Commit
β’
f15e18d
1
Parent(s):
711be4b
Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,6 @@
|
|
1 |
---
|
2 |
license: cc-by-nc-sa-4.0
|
3 |
---
|
4 |
-
|
5 |
# π CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
|
6 |
|
7 |
<div style="display: flex; justify-content: center; align-items: center;">
|
@@ -29,12 +28,14 @@ license: cc-by-nc-sa-4.0
|
|
29 |
</div>
|
30 |
|
31 |
|
32 |
-
|
33 |
**CatVTON** is a simple and efficient virtual try-on diffusion model with ***1) Lightweight Network (899.06M parameters totally)***, ***2) Parameter-Efficient Training (49.57M parameters trainable)*** and ***3) Simplified Inference (< 8G VRAM for 1024X768 resolution)***.
|
34 |
|
35 |
|
36 |
|
37 |
## Updates
|
|
|
|
|
|
|
38 |
- **`2024/08/10`**: Our π€ [**HuggingFace Space**](https://huggingface.co/spaces/zhengchong/CatVTON) is available now! Thanks for the grant from [**ZeroGPU**](https://huggingface.co/zero-gpu-explorers)οΌ
|
39 |
- **`2024/08/09`**: [**Evaluation code**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#3-calculate-metrics) is provided to calculate metrics π.
|
40 |
- **`2024/07/27`**: We provide code and workflow for deploying CatVTON on [**ComfyUI**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#comfyui-workflow) π₯.
|
@@ -47,7 +48,14 @@ license: cc-by-nc-sa-4.0
|
|
47 |
|
48 |
|
49 |
## Installation
|
50 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
## Deployment
|
53 |
### ComfyUI Workflow
|
@@ -63,7 +71,10 @@ To deploy CatVTON to your ComfyUI, follow these steps:
|
|
63 |
>
|
64 |
When you run the CatVTON workflow for the first time, the weight files will be automatically downloaded, usually taking dozens of minutes.
|
65 |
|
66 |
-
|
|
|
|
|
|
|
67 |
<!-- <div align="center">
|
68 |
<img src="resource/img/comfyui.png" width="100%" height="100%"/>
|
69 |
</div> -->
|
@@ -96,7 +107,7 @@ Once the datasets are downloaded, the folder structures should look like these:
|
|
96 |
β β β βββ [000006_00_mask.png | 000008_00.png | ...]
|
97 |
...
|
98 |
```
|
99 |
-
|
100 |
```
|
101 |
βββ DressCode
|
102 |
| βββ test_pairs_paired.txt
|
@@ -110,6 +121,11 @@ For the DressCode dataset, we provide [our preprocessed agnostic masks](https://
|
|
110 |
β β β βββ [013563_0.png| 013564_0.png | ...]
|
111 |
...
|
112 |
```
|
|
|
|
|
|
|
|
|
|
|
113 |
|
114 |
### 2. Inference on VTIONHD/DressCode
|
115 |
To run the inference on the DressCode or VITON-HD dataset, run the following command, checkpoints will be automatically downloaded from HuggingFace.
|
|
|
1 |
---
|
2 |
license: cc-by-nc-sa-4.0
|
3 |
---
|
|
|
4 |
# π CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
|
5 |
|
6 |
<div style="display: flex; justify-content: center; align-items: center;">
|
|
|
28 |
</div>
|
29 |
|
30 |
|
|
|
31 |
**CatVTON** is a simple and efficient virtual try-on diffusion model with ***1) Lightweight Network (899.06M parameters totally)***, ***2) Parameter-Efficient Training (49.57M parameters trainable)*** and ***3) Simplified Inference (< 8G VRAM for 1024X768 resolution)***.
|
32 |
|
33 |
|
34 |
|
35 |
## Updates
|
36 |
+
- **`2024/10/17`**:[**Mask-free version**](https://huggingface.co/zhengchong/CatVTON-MaskFree)π€ of CatVTON is release and please try it in our [**Online Demo**](http://120.76.142.206:8888).
|
37 |
+
- **`2024/10/13`**: We have built a repo [**Awesome-Try-On-Models**](https://github.com/Zheng-Chong/Awesome-Try-On-Models) that focuses on image, video, and 3D-based try-on models published after 2023, aiming to provide insights into the latest technological trends. If you're interested, feel free to contribute or give it a π star!
|
38 |
+
- **`2024/08/13`**: We localize DensePose & SCHP to avoid certain environment issues.
|
39 |
- **`2024/08/10`**: Our π€ [**HuggingFace Space**](https://huggingface.co/spaces/zhengchong/CatVTON) is available now! Thanks for the grant from [**ZeroGPU**](https://huggingface.co/zero-gpu-explorers)οΌ
|
40 |
- **`2024/08/09`**: [**Evaluation code**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#3-calculate-metrics) is provided to calculate metrics π.
|
41 |
- **`2024/07/27`**: We provide code and workflow for deploying CatVTON on [**ComfyUI**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#comfyui-workflow) π₯.
|
|
|
48 |
|
49 |
|
50 |
## Installation
|
51 |
+
|
52 |
+
Create a conda environment & Install requirments
|
53 |
+
```shell
|
54 |
+
conda create -n catvton python==3.9.0
|
55 |
+
conda activate catvton
|
56 |
+
cd CatVTON-main # or your path to CatVTON project dir
|
57 |
+
pip install -r requirements.txt
|
58 |
+
```
|
59 |
|
60 |
## Deployment
|
61 |
### ComfyUI Workflow
|
|
|
71 |
>
|
72 |
When you run the CatVTON workflow for the first time, the weight files will be automatically downloaded, usually taking dozens of minutes.
|
73 |
|
74 |
+
<div align="center">
|
75 |
+
<img src="resource/img/comfyui-1.png" width="100%" height="100%"/>
|
76 |
+
</div>
|
77 |
+
|
78 |
<!-- <div align="center">
|
79 |
<img src="resource/img/comfyui.png" width="100%" height="100%"/>
|
80 |
</div> -->
|
|
|
107 |
β β β βββ [000006_00_mask.png | 000008_00.png | ...]
|
108 |
...
|
109 |
```
|
110 |
+
|
111 |
```
|
112 |
βββ DressCode
|
113 |
| βββ test_pairs_paired.txt
|
|
|
121 |
β β β βββ [013563_0.png| 013564_0.png | ...]
|
122 |
...
|
123 |
```
|
124 |
+
For the DressCode dataset, we provide script to preprocessed agnostic masks, run the following command:
|
125 |
+
```PowerShell
|
126 |
+
CUDA_VISIBLE_DEVICES=0 python preprocess_agnostic_mask.py \
|
127 |
+
--data_root_path <your_path_to_DressCode>
|
128 |
+
```
|
129 |
|
130 |
### 2. Inference on VTIONHD/DressCode
|
131 |
To run the inference on the DressCode or VITON-HD dataset, run the following command, checkpoints will be automatically downloaded from HuggingFace.
|