Closed ashutoshIITK closed 7 months ago
Few shot inferneces should have ability to understand the multiple images (only texts can be possible). However, MoAI is not traoned with mutiple images and does not support multiple image tokens.
Got it! Thank you for the quick response!
Is it possible to feed one image and caption and then do another inference as in few shot inference?