-
Incomplete Image Caption and Excessively Long Inference Time using halc:
When I use the Minigpt4 model to generate an Image Caption with the halc method using your provided code, it not only genera…
-
[Generating Visual Explanations](https://link.springer.com/chapter/10.1007/978-3-319-46493-0_1)
Clearly explaining a rationale for a classification decision to an end user can be as important as the …
-
**Motivation**
Improve the benchmark performance of all algorithms based on TextOCR dataset released by Facebook AI research team
**Related resources**
https://textvqa.org/textocr
**Overvi…
-
Traceback (most recent call last):
File "/mnt/Pycharm_Remote/DLCT_test/train.py", line 335, in
scores = evaluate_metrics(model, dict_dataloader_val, text_field)
File "/mnt/Pycharm_Remote/DLCT_test…
-
### Metadata
- Authors: Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh
- Organization: Georgia Institute of Technology & Facebook AI Research
- Conference: CVPR 2018
- Paper: https://arxiv.org/…
-
Hello
I ran the demo.py for an image and it works. Now trying to do captioning on a list of images
**code snippet:**
captions_LLAMA = []
for image in igs_trnsfmd:
caption = model.generat…
-
Hi,
For reasons of reproducibility, it would be great if you provided source code to reproduce the results on ScienceQA.
Thanks.
-
Hello!
Great work! Was this model trained for classification? Not sure, but if it was trained for some task, then it should contain linear layers, pooling layers, which can be removed if I want to …
-
**Is your feature request related to a problem? Please describe.**
I have been actively using this repository for multimodal training involving images and text. It has been incredibly helpful for my …
-
Hello, recently I am building a network that can produce both masks and bounding box level captions. I refer to the [mask rcnn](https://arxiv.org/pdf/1703.06870.pdf) and [densecap](https://arxiv.org/…
Askfk updated
3 years ago