-
Hi, thank you for your excellent work!
I noticed in Table 6 that the TGIF Flops=100% baseline accuracy is much lower than reported in the original paper:
- Video-LLaVA reports TGIF Accuracy/Score …
-
Hi there,
Could you please provide an example as how we should run Video Question Answering task using LAVIS?
Any examples about other video-related tasks well be very appreciated.
-
Dear Authors,
How can I use the Internvideo2 model for Video Question Answering or Summarization tasks given a video? Please provide a demo script if any for testing on new videos.
Thanks.
-
Agenda:
1. Overview of the [learning goals for last week](https://github.com/earthlab-education/Earth-Analytics-AY24/milestone/1) and [learning goals for this week](https://github.com/earthlab-educ…
-
### Prerequisites
- [X] I am running the latest code. Mention the version if possible as well.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…
-
- [ ] Add an example of a good pull request description to [Best Practices / Pull Requests](https://assemblyline.suffolklitlab.org/docs/github#branches-pull-requests-and-commits).
- [ ] Document resp…
-
When calling, e.g., `textureSampleGrad(tex, sampler, st, dst_dx, dst_dy)`, is the effective sampled area of the texture a patch with "origin" at `st`, e.g. `st + J * xy`, with `xy` in `[0,1]^2` and J …
-
## Overview
When copying a password with CTRL + C the password is only cleared after the countdown if the KeepassXC window is in focus. If it is out of focus, the password remains in the clipboard in…
-
Thanks for your great work. The detailed facial expression editor is awesome. Is it possible to use it with video instead of image?
-
Hi @wang-sj16,
Niels here from the open-source team at Hugging Face. I found your work through ECCV (congrats!), and indexed your paper here: https://huggingface.co/papers/2311.13627, congrats on g…