Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
https://minigpt-4.github.io
BSD 3-Clause "New" or "Revised" License
25.45k stars 2.92k forks source link

Ask anything in video #49

Open Andy1621 opened 1 year ago

Andy1621 commented 1 year ago

Hi! We have simply extended MiniGPT-4 for video question answering in our project Ask-Anything. Without extra instruction fine-tuning, current results are not satisfactory. image

In our other try, we simply encode the video as captions, and input them with ChatGPT, which provides better results. image

Now we are trying to build a real video ChatBot with fantastic techniques as used in MiniGPT-4 and Llava. Hopefully, everyone can try our demo, and find the problem, we will try our best to fix it in our future ChatBot.

2132660698 commented 1 year ago

greate work!

TsuTikgiau commented 1 year ago

Cool!

0000070 commented 1 year ago

小黑子食不食油饼,树枝666

Andy1621 commented 1 year ago

你干嘛~

小黑子食不食油饼,树枝666