LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context
https://largeworldmodel.github.io/
Apache License 2.0
7.15k stars 552 forks source link

How to setup conversation with vision chat? #56

Open kylekam opened 8 months ago

kylekam commented 8 months ago

I'm currently able to use run_vision_chat.sh with a limited number of video frames being passed in for a single text query. The text result is output from the model and then the process ends. However, the paper shows examples of a continuous dialogue about a video and I was wondering if it's possible to set this up.

infocodiste commented 8 months ago

@kylekam what tpu version are using?