-
Hi, guys, thanks for your work.
I got a question: the fixed policy templates are too long, which can seriously affect the speed of model inference, have you considered optimisation methods?
Is it po…
-
Hello. Thank you for your excellent work.I have some questions about the statements in the paper and hope to receive your answers。In Table 3, you compared the differences between your method and other…
-
### Describe the issue
Issue:
Command:
```
Bash pretrain.sh on my fineunted Llama2 model.
```
Log:
```
You should probably TRAIN this model on a down-stream task to be able to use it for…
-
Paper : [https://arxiv.org/pdf/2406.16860](https://arxiv.org/pdf/2406.16860)
Website : [https://cambrian-mllm.github.io](https://cambrian-mllm.github.io)
Code : [https://github.com/cambrian-mllm/cam…
-
Thanks for the great repository. I finetuned a LLaVA-Next-Video model and I was wondering if it is possible to infer it via LLaVA Next demo [script](https://github.com/LLaVA-VL/LLaVA-NeXT/tree/video_i…
-
Hello, author.
When running the inference demo of the model "lmms-lab/LLaVA-Video-7B-Qwen2," an error occurred while loading the vision tower (siglip-so400m-patch14-384):
File "/home/jeeves/…
-
Hi, thank you for your great work! Following your setup instruction, I ran the commands below.
```sh
conda env create -f cola.yml
cd ..
git lfs clone https://huggingface.co/OFA-Sys/ofa-large
pyt…
-
Currently clip.cpp uses linear interpolation in image preprocessing. The original implementation uses the bicubic interpolation from Pillow. It needs refactoring from Pillow https://github.com/python-…
-
### Describe the issue
Hello, there is currently a self built dataset for 80K object detection, which is used to detect the position of objects in the image. The image size is 1920x1080. When I use t…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch…