Open AliAbdulRehman opened 7 months ago
Question
Can the chatbot have multiple sequential images as input? I'm trying to predict the pedestrian trajectory and my inputs are multiple frames of a video. How can the model understand sequential images?
It doesn't. Have a look at the video Llava repo
Question
Can the chatbot have multiple sequential images as input? I'm trying to predict the pedestrian trajectory and my inputs are multiple frames of a video. How can the model understand sequential images?