I am seeking to utilize the llava-next-110b model to generate complex descriptions for an input image. Having tested numerous images, I've encountered instances where the inference result is empty, indicated as ['']. Could you please enlighten me on the possible reasons behind receiving an output of [''], thank you.
I am seeking to utilize the llava-next-110b model to generate complex descriptions for an input image. Having tested numerous images, I've encountered instances where the inference result is empty, indicated as ['']. Could you please enlighten me on the possible reasons behind receiving an output of [''], thank you.