-
### Is your enhancement related to a problem? Please describe.
Related to #826 and alongside adding OpenAI to alt text generation, let's look to update the Azure API version in ClassifAI.
While ther…
-
### What needs to change?
Similar to #1268, we will need to develop a computer vision model to detect the red, green, and blue tins that lay on the drone landing pads. To the down camera, these will …
-
#### Issue
I'm not sure about the proper workflow to use with interpreter vision after reading [this](https://changes.openinterpreter.com/log/local-iii). For the record, I separately installed moondr…
-
### Just Dire Things version
1.4.5
### Minecraft Version
1.21.1
### (Neo)Forge Version
21.1.69
### Modpack & Version
Direwolf20 - 1.21 (1.3.0)
### Do you have optifine or Rubid…
-
When loading an image into a vision model, the LLM will reply with some version of "there is no image" or "As an AI, I can only work with text". This happens even though the model has vision capabilit…
-
看原文应该是tune了vision tower的,但是在lmms-lab/llava-onevision-qwen2-7b-ov的config.json中,有`"mm_vision_tower": "google/siglip-so400m-patch14-384"`。看上去是加载了原始的vision tower。
这里有个问题,不知道是不是先加载原始的vision tower然后再进行的参数覆…
-
Would be nice to have this one integrated to Quartz Solar Forecast
refer to the docs: https://documentation.auroravision.net/index.html%3Fp=31.html
please add the integration in : https://github.c…
-
The prototype token should be updated with the appropriate vision mode when a feature is added to the character.
The example below is specific to darkvision, but a fix should apply to all special vis…
-
Hi,
Thanks for the great work. Is there any example how this could be used with standard (Pytorch) Vision Transformers?
Many thanks,
Sid
-
### Description
Using structured output with vision models like gpt-4o-mini works. I'd like to do the same for Llama-3.2-11B-Vision-Instruct from GitHub models. Currently it throws an exception.
##…