alibaba / AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Apache License 2.0
1.98k stars 291 forks source link

How can I do inference on a single image for VQA task? #70

Open abhipn opened 1 year ago

abhipn commented 1 year ago

Documentation seems bit rough. Not able to find how I can just do inference. If someone has an example code for inference for VQA task, please do share.

kenhuang1964 commented 1 year ago

Hey @abhipn, did you end up figuring out how to do this?