rmokady / CLIP_prefix_caption

Simple image captioning model
MIT License
1.29k stars 214 forks source link

Low variety of outputs. #50

Open hideosnes opened 2 years ago

hideosnes commented 2 years ago

Heihei, here's my issues:

a) I'm getting a very low variety of outputs with ([very] different) custom images, e.g. "...sitting on a cellphone", "...with a cellphone", "...a cellphone and a surfboard".

b) The captions don't change after subsequent runs. Is this an issue with the model or with weights?

c) Allthough I altered the 'entry_length' and 'stop_token' (and also temperatur and p) it has no effect on the caption whatsoever.

Has any1 an idea what I'm doing wrong or could you point me in the right direction for research?

Thank you in advance and all the best, Hidéo