microsoft / GenerativeImage2Text

GIT: A Generative Image-to-text Transformer for Vision and Language
MIT License
545 stars 68 forks source link

digital art selected for the # #55

Closed urlhearts closed 1 month ago

urlhearts commented 1 year ago

some result is digital art selected for the # , why?

amsword commented 1 year ago

can you provide more context, e.g. the command line you are using, the result you gets, the image you tested. My guess is that you were using the ptretrained model? the pretrained model was trained on quite noisy datasets, which means the captioning result from such pretrained checkpoint will be quite noisy.