Closed timegate closed 6 months ago
Below code also doesn't work.. (makes shape error in sampling)
garment_condition = tform(Image.open(garment_condition_path).resize((224,224), Image.LANCZOS).convert("RGB"))
text_features = clip_model3(garment_condition.unsqueeze(0).cuda(non_blocking=True))
text_features = text_features.last_hidden_state
current, I just release the version using text to train stage 1, if you want to use img clip, I think you can train a model.
I succeed to inference your work with text clip in stage1, but I saw blurry results when using img clip
I tried below code
Could you please help me to use img clip?