Nerogar / OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.
GNU Affero General Public License v3.0
1.37k stars 118 forks source link

Captions generated are very short #84

Open Aamir3d opened 7 months ago

Aamir3d commented 7 months ago

Hello, I was using BLIP captioning and I found that captions are getting cut off rather than being complete captions. Is there a way to extend the length of the tokens/captions like we can do in Kohya or CLIP Interrogator?

Example. OneTrainer Caption: a woman in a long dress, holding a wand and pointing at the

Clip Interrogator 2 Fast mode: https://huggingface.co/spaces/fffiloni/CLIP-Interrogator-2 a woman standing on top of a mountain holding a wand, digital art fantasy art, digital art fantasy, very beautiful fantasy art, greek myth digital painting, fantasy digital art, very beautiful digital art, sky witch, beautiful fantasy art, maya ali as a wind mage, [[fantasy]], detailed cover artwork, fantasy digital painting, dreamlike digital painting

aliftadvantage commented 5 months ago

Wanted to follow up on this, I think the default setting for BLIP is 20 tokens. Maybe we could add a parameter to allow us to adjust this?