kabachuha / sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Other
1.28k stars 108 forks source link

Add Stable LoRA support #201

Closed ExponentialML closed 1 year ago

ExponentialML commented 1 year ago

image

Currently only works with ModelScope based models. Stable LoRA is simply a version using the official Microsoft's implementation, and is made specifically for Stable Diffusion based models. While still in preview in its home repository, this release is fully functional.

Please read the open PR on the finetuning repositories for details and tracking. https://github.com/ExponentialML/Text-To-Video-Finetuning/pull/90#issue-1795207411

In short, this allows you to use LoRA models on the fly during inference. The only files that are supported are the ones trained in the aforementioned repository.

What you cannot do:

Dev notes:

I've also added an extension helper to this release as well. It should serve as a simple baseline to make any other extensions.

How to use

Simply place the LoRA files after training in your webui lora models directory. Everything else will be taken care of, and will show up in the list.

This PR is ready to go to market as the training code is already available in the finetuning repository (just pull the PR). For ease of use (non developer / code friendly individuals) will need hold until the fine tuning repository PR is completed.

UPDATE:

The finetune PR is ready to go. I'm just testing for any last bugs before committing.

UPDATE 2: The finetune PR is now merged into main.