Open xddun opened 1 year ago
The README.md said that "Given an input image, we first generate text captions using BLIP and apply regularized DDIM inversion to obtain our inverted noise map."
so you actually need to install BLIP, through
pip install salesforce-lavis
我也是和你卡在一样的位置,但是我是报显存不够gpu 12G显存不够推理吗? I'm also stuck in the same position as you, but I'm reporting that the video memory is not enough gpu 12G video memory is not enough to reason?
@xddun Hello, I am also stuck in the same place and unable to continue. Have you solved this problem? If so, could you give me some advice?
I have encountered a strange problem, always stuck at 48/50. I suspect that the version of a certain package is incorrect. Can the author provide the version of the package in the environment. 我遇到了一个奇怪的问题,总是卡在48/50。我怀疑是某个包的版本不对,作者可以给一下环境里的包的版本吗。
The following is the log, which stops every time it reaches 48/50. 下面是日志,每次运行到48/50 就停止了。
(ldm) xiedong@gpu20:/ssd/xiedong/home/InST-main/pix2pix-zero-main$ python src/inversion.py --input_image "assets/test_images/cats/cat_1.png" --results_folder "output/test_cat" text_encoder/pytorch_model.fp16.safetensors not found Fetching 20 files: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:00<00:00, 141460.51it/s] The config attributes {'scaling_factor': 0.18215} were passed to AutoencoderKL, but are not expected and will be ignored. Please verify your config.json configuration file. 96%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▎ | 48/50 [00:09<00:00, 5.09it/s]
Here is the version of my package.
下面是我的包的版本。