-
The speech to text wf step which sends files to s3 (`fetch-files`) often causes AWS exceptions, which go away on retries. Need to debug.
See https://app.honeybadger.io/projects/52894/faults/11369050…
-
Hi! Thanks for your great work.
Could you please release the following evaluation codes?
model_video_chatgpt_general.py
eval_activitynet_qa.py
model_video_detail_description.py
-
### Context
Beneficiary name _Rinku Jakesika: [DEA link](https://app.avniproject.org/#/app/subject?uuid=da186e1a-217d-4e28-bfb6-bda6eed304ec)
The issue exactly here is that the Height/Weight autopopu…
-
Thank you very much for sharing the beautiful code. I am an undergraduate very interested in video QA. Can you share the code for zero-shot video QA? Best Wishes!
-
Hi Team,
I saw that LLaVA-NeXT-Video-32B-Qwen obtains 77.31%, 63% accuracy on NeXT-QA and Egoschema here: https://huggingface.co/lmms-lab/LLaVA-NeXT-Video-32B-Qwen.
On the other hand, LLaVA-NeXT…
-
As an Idea Workshopper, I want to sort by which idea cards have the MOST votes to see what is popular, as well as which idea cards have the LEAST votes so that I view which cards might be newer or rec…
-
##### Environment
- [x] The MPD passes the DASH-IF Conformance Tool on https://conformance.dashif.org/
- [x] The stream has correct Access-Control-Allow-Origin headers (CORS)
- [x] There are no n…
-
# Request the Contact Center Review
## What this form is for
Use this template to request a Contact Center review of your product.
Please review the Self-service Product Guide Template [link](https…
-
For the example in this page: https://github.com/mit-han-lab/llm-awq/tree/main/tinychat#usage
You can easily inference on images:
python vlm_demo_new.py \
--model-path VILA1.5-13b-AWQ \
…
-
Thank you for your great work! I was reading the paper and noticed that the average video duration in the VStream-QA dataset is mentioned as 40 minutes. However, after downloading the dataset, I obser…