liusida / ComfyUI-B-LoRA

A ComfyUI custom node that loads and applies B-LoRA models.
MIT License
68 stars 6 forks source link

ComfyUI-B-LoRA

A ComfyUI custom node that loads and applies B-LoRA models.

Figure

Reproducing

What is B-LoRA?

B-LoRA: By implicitly decomposing a single image into its style and content representation captured by B-LoRA, we can perform high quality style-content mixing and even swapping the style and content between two stylized images.

Advantages of B-LoRA

  1. Can apply Style or Content, or both.

  2. Much smaller model files. (~100M for SDXL B-LoRAs)

  3. One B-LoRA only needs one image as training dataset and 15 minutes to train. (on a single RTX 4090)

Please share your B-LoRA models on Civit.ai or HuggingFace!

Node

Load B-LoRA

Workflow Examples

A Single Load B-LoRA node

workflow 🌟 <s> is the training prompt for one B-Lora colorful-squirrel

A B-LoRA for Style, and another for Content

workflow 🌟 <s> is the training prompt for one B-Lora colorful-squirrel, and <p> is the training prompt for the other pencil-boy.

B-LoRA models used in the workflows can be downloaded here:

https://huggingface.co/sida/B-LoRA-examples/tree/main

More pretrained B-LoRAs to try out:

https://huggingface.co/lora-library?sort_models=downloads#models

Train Your B-LoRAs (WIP):

I'm building a docker image for training. Please check train to see current progress.

Credit goes to:

Citation

If you use B-LoRA in your research, please cite the authors' paper:

@misc{frenkel2024implicit,
      title={Implicit Style-Content Separation using B-LoRA}, 
      author={Yarden Frenkel and Yael Vinker and Ariel Shamir and Daniel Cohen-Or},
      year={2024},
      eprint={2403.14572},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}