-
I use your program all the time for captioning new dataset images, however I sometimes misclick and accidentally crop or resize images. I normally leave resizing to after I have finished the dataset, …
-
I read in the readme file, paligemma can captioning a short video, anyone can guide me to do that?
Does it extract every frames on the video? Or does the paligemma tokenizer directly support video…
-
How do you add metadata to ImageClassification tasks as shown in the Demo ImageClassification task below? It seems you can only upload images.
![image](https://user-images.githubusercontent.com/567…
-
Hi! I have been having some trouble to get the repo and models working. Specially, I tried to run the evaluation scripts (specifically COCO captioning) as reported in the README using the checkpoint t…
-
**Motivation**
Improve the benchmark performance of all algorithms based on TextOCR dataset released by Facebook AI research team
**Related resources**
https://textvqa.org/textocr
**Overvi…
-
Dear Authors,
Thanks for your great work! I was wondering if it would be possible for you to release the bounding box coordinates for each image used in MedTrinity-25M. Access to this information w…
-
`!!! Exception during processing !!! cannot access local variable 'image1' where it is not associated with a value
Traceback (most recent call last):
File "C:\Users\Robert\Downloads\ComfyUI_window…
-
Using the old method, I get the following error when trying to run the "start dreambooth" code
I have run the dependencies block twice
```
The following values were not passed to `accelerate la…
-
List of images that will need additional attention during contextualization
**Lesson 1**
- [ ] [Image](https://github.com/nasa/Transform-to-Open-Science/blob/open-science-101/Module_4/images/med…
-
HI, the predicted results using checkpoint-29-66420 are wrong, like:
391895 [{"caption": "nearly marcia drippedtangletangleyo pat hypothetical hyper pat hypothetical hypothetical hyderabadtangleyo p…