paulo-coronado / comfy_clip_blip_node

28 stars 6 forks source link

A ComfyUI Node for adding BLIP in CLIPTextEncode

Announcement: BLIP is now officially integrated into CLIPTextEncode

Dependencies

Local Installation

Inside ComfyUI_windows_portable\python_embeded, run:

python.exe -m pip install fairscale

And, inside ComfyUI_windows_portable\ComfyUI\custom_nodes\, run:

git clone https://github.com/paulo-coronado/comfy_clip_blip_node

Google Colab Installation

Add a cell with the following code:

!pip install fairscale
!cd custom_nodes && git clone https://github.com/paulo-coronado/comfy_clip_blip_node

How to use

  1. Add the CLIPTextEncodeBLIP node;
  2. Connect the node with an image and select a value for min_length and max_length;
  3. Optional: if you want to embed the BLIP text in a prompt, use the keyword BLIP_TEXT (e.g. "a photo of BLIP_TEXT", medium shot, intricate details, highly detailed).

Acknowledgement

The implementation of CLIPTextEncodeBLIP relies on resources from BLIP, ALBEF, Huggingface Transformers, and timm. We thank the original authors for their open-sourcing.