-
File "/home/ma-user/anaconda3/envs/llava/lib/python3.10/site-packages/transformers/configuration_utils.py", line 264, in __getattribute__
return super().__getattribute__(key)
AttributeError: '…
-
### Feature request
This is a tracker issue for work on _interleaved_ in-and-out image-text generation.
There are now >= 5 open-source models that can do _interleaved_ image-text generation--and…
-
### Describe the issue
Issue:
I try to run your given example: Detect the person and frisbee in the image in detection examples. Sometime i meet error: SyntaxError: invalid syntax
Log:
```
2…
-
The [announcement blog post](https://llava-vl.github.io/blog/2024-04-30-llava-next-video/) indicates inference can be done with sglang, but attempting to load the 7b model with the sglang backend:
…
-
1.5阶段不是加入了中文ocr数据么,为什么识别中文依旧没有任何效果
-
In the file conversation.py, the Llama-3 chat is given by the line 107
self.tokenizer.apply_chat_template(chat_template_messages, tokenize=False, add_generation_prompt=False)
which means the token …
-
### What model would you like?
Add Video-LLaVA to be then used easily
https://github.com/PKU-YuanGroup/Video-LLaVA/tree/main
-
### Question
Hello LLaVA Team,
I've been working on fine-tuning the LLaVA v1.5-7B model on a custom dataset using the provided `finetune_task_lora.sh` script. Here is the configuration I used:
`b…
-
您好,请教您一下,您对训练数据json修改的方式的是否是多张相同的图片?
因为我看只有一个。
如果要实现两个不同的图片,是不是构建数据集应该要和 这种形式?
ps:纯小白,刚开始看MLLM,像迁移到自己的任务上。见谅!
-
### Motivation
a relatively lightweight, but the effect is very good multimodal model
https://github.com/OpenBMB/MiniCPM-V
### Related resources
_No response_
### Additional context
_No response…