Closed mridulbirla closed 1 year ago
Thank you for bringing this to our attention. We appreciate your effort in testing and sharing the feedback.
Upon reviewing the case you've presented, we've observed similar issues not only with Cheetah but also with other multimodal LLMs. It appears to be a more generalized problem for multimodal LLMs.
Rest assured, we recognize the importance of addressing this issue and will work towards finding a solution in upcoming updates.
I tried testing this with the below sample image and the modified
test_cheetah_llama2.py
toThe output looks super wierd. . Is there something I am doing wrong or you also encounter same kind of issue. I using Llam2