-
![bilde](https://github.com/ceruleandeep/ComfyUI-LLaVA-Captioner/assets/112418655/75ae7572-215a-409e-94b8-82875ac2b2ea)
Love this node!
It would be a great boon if when captioning a list of images…
-
Hi! Thanks for your great work! I am curious about how to get multi-view object images in the "Object Caption" step of your annotation pipeline. It seems that only a 3D point cloud and object bounding…
-
Hello, I wondered if there was a way to train the model and new data. I am sorry if I missed the documentation somewhere.
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What would your feature do ?
[This hypernetwork extension](https://githu…
-
Hi,
Thanks for sharing your great paper and code!
I am wondering about a use case on retrieval only mode (without dialogue or question ansewring).
is training the "Image-captioning" model benefit…
-
Hi,
I'm trying to reproduce the results reported on "InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning". But, I'm facing difficulty reproducing the InstructBLIP …
-
from the official coco website
Images
2014 Train images [83K/13GB]
2014 Val images [41K/6GB]
2014 Test images [41K/6GB]
2015 Test images [81K/12GB]
2017 Train images [118K/18GB]
2017 Val imag…
-
Training the model...
epoch: 0%| | 0/100 [00:00
-
I'm running on a remote server with shared=True enabled. I click "Add AI Captions with Florence-2", it goes to "Processing" and it never finishes. No errors on the console, no timeouts, it just neve…
fase updated
13 hours ago
-
Excuse me, author!
May I ask if this supports image captioning tasks, especially for generating text descriptions of fine-grained images?
Thank you! Looking forward to your reply!