CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
Other
964
stars
114
forks
source link
how to get pairs from deepfashion(in-shop clothes retrieval)? #53
How were these person image pairs collected from the DeepFashion dataset? Are they considered as pairs where the same individual is wearing the same garment in different poses?
How were these person image pairs collected from the DeepFashion dataset? Are they considered as pairs where the same individual is wearing the same garment in different poses?