Low variety of outputs.

Heihei, here's my issues:

a) I'm getting a very low variety of outputs with ([very] different) custom images, e.g. "...sitting on a cellphone", "...with a cellphone", "...a cellphone and a surfboard".

b) The captions don't change after subsequent runs. Is this an issue with the model or with weights?

c) Allthough I altered the 'entry_length' and 'stop_token' (and also temperatur and p) it has no effect on the caption whatsoever.

Has any1 an idea what I'm doing wrong or could you point me in the right direction for research?

Thank you in advance and all the best, Hidéo

rmokady / CLIP_prefix_caption

Low variety of outputs. #50