visual-captioning Search Results

473 results
for visual-captioning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

BillChan226/HALC #17

Incomplete Image Caption and Excessively Long Inference Time

Incomplete Image Caption and Excessively Long Inference Time using halc: When I use the Minigpt4 model to generate an Image Caption with the halc method using your provided code, it not only genera…

pspdada updated 5 hours ago
4
dais-ita/interpretability-papers #38

Generating Visual Explanations

[Generating Visual Explanations](https://link.springer.com/chapter/10.1007/978-3-319-46493-0_1) Clearly explaining a rationale for a classification decision to an end user can be as important as the …

richardtomsett updated 6 years ago
1
open-mmlab/mmocr #259

Performance on TextOCR Dataset

**Motivation** Improve the benchmark performance of all algorithms based on TextOCR dataset released by Facebook AI research team **Related resources** https://textvqa.org/textocr **Overvi…

jkcg-learning updated 3 years ago
6
aimagelab/meshed-memory-transformer #60

A code question

Traceback (most recent call last): File "/mnt/Pycharm_Remote/DLCT_test/train.py", line 335, in scores = evaluate_metrics(model, dict_dataloader_val, text_field) File "/mnt/Pycharm_Remote/DLCT_test…

GX77 updated 2 years ago
17
howardyclo/papernotes #30

Neural Baby Talk

### Metadata - Authors: Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh - Organization: Georgia Institute of Technology & Facebook AI Research - Conference: CVPR 2018 - Paper: https://arxiv.org/…

howardyclo updated 5 years ago
8
OpenGVLab/LLaMA-Adapter #102

RuntimeError: The size of tensor a (730) must match the size…

Hello I ran the demo.py for an image and it works. Now trying to do captioning on a list of images **code snippet:** captions_LLAMA = [] for image in igs_trnsfmd: caption = model.generat…

parasmech updated 1 year ago
1
OpenGVLab/LLaMA-Adapter #6

Code for reproducing evaluation results on ScienceQA

Hi, For reasons of reproducibility, it would be great if you provided source code to reproduce the results on ScienceQA. Thanks.

TJKlein updated 1 year ago
9
shilrley6/Faster-R-CNN-with-model-pretrained-on-Visual-Genome #2

Extract model weights for other tasks

Hello! Great work! Was this model trained for classification? Not sure, but if it was trained for some task, then it should contain linear layers, pooling layers, which can be removed if I want to …

nilinykh updated 4 years ago
1
mlfoundations/open_flamingo #305

[FEATURE REQUEST] Enable Video Training

**Is your feature request related to a problem? Please describe.** I have been actively using this repository for multimodal training involving images and text. It has been incredibly helpful for my …

simplaj updated 3 months ago
2
matterport/Mask_RCNN #2294

Training on combined coco and visual genome dataset and then…

Hello, recently I am building a network that can produce both masks and bounding box level captions. I refer to the [mask rcnn](https://arxiv.org/pdf/1703.06870.pdf) and [densecap](https://arxiv.org/…

Askfk updated 3 years ago
3

上一页 1...2 3 4 5 6 7 8...48 下一页

473 results for visual-captioning

473 results
for visual-captioning