Closed godimarcovr closed 8 months ago
For 1 - explained here in README: https://github.com/hotshotco/Hotshot-XL#text-to-gif-with-personalized-loras. Use the --spatial_unet_base="path/to/stabilityai/stable-diffusion-xl-base-1.0/unet" \
parameter. If you are using the base Hotshot-XL model (not fine tuned at higher resolutions), we'd recommend using some base u-net that has been trained at or around the 512 aspect ratio.
We've tested LoRA compatibility with diffusers format LoRAs. Sounds like the keys are mismatching because your LoRAs are safetensors format? We hope to add support for other LoRA formats soon, and would also greatly appreciate any help in PRs!
Hello, thanks for this amazing project. You mention in the README
I can't find out how to achieve n.1, which argument should be used to use a fine-tuned SDXL model? As for n.2, I have been trying to follow the instructions in the relevant section, thanks to the latest commit I am able to load the UNet from stabilityai/stable-diffusion-xl-base-1.0 in safetensors, but when I try to use a LoRA I get an error: "The following keys have not been correctly be renamed" followed by a bunch of state_dict keys such as "lora_te1_text_model_encoder" keys, "lora_te2_text_model_encoder" keys and "loraunet" keys. Maybe some renaming is needed?
For reference, I am mainly interested in using SDXL finetunes and loras from civitai.com such as https://civitai.com/models/131243/robocop for example, which are in .safetensors format.
Thank you for the great work!