captioning Search Results

1000+ results
for captioning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

RunpeiDong/DreamLLM #25

Where do the CIDEr scores on COCO come from?

Hello, recently I've been working on image-captioning tasks and noticed that you reported CIDEr scores on COCO in your paper. In fact, I failed to find any golden annotations of COCO test set so I won…

XqZeppelinhead0702 updated 3 weeks ago
2
FrostCo/AdvancedProfanityFilter #568

Amazon Prime Video needs MODE change

#### :bug: I need to change the MODE every time I watch a new movie on Amazon. Program version 3.5.1 Closed captioning/subtitles turn ON in Amazon Prime Video I will switch to Amazon Prime vide…

312Moses updated 2 months ago
2
TheLastBen/fast-stable-diffusion #1061

Use again image caption instead of instance word

Hi, Is it possible to use image caption for training? Here you only mention filename that will be the trigger word but you made disappear all options on caption training. Could you implement it …

remybonnav updated 1 year ago
6
agermanidis/autosub #93

Customizing autosub to caption TED Talks

Dear @agermanidis, I am Roberto Minelli and I am part of the Team that organizes [TEDxLakeComo](http://www.tedxlakecomo.com). For the captioning process of [TED Talks](https://www.ted.com/talks)…

RobertoMinelli updated 3 years ago
2
w3c/presentation-viewer #37

Links needed for 'kiosk' version

If you watch Tzviya's CEPC talk after clicking on "Sync video and hide transcript", you'll see no links at all - including no link to the document she's talking about. The links are in the longer tra…

r12a updated 4 years ago
1
microsoft/Oscar #33

Generating label.tsv and feature.tsv from image

Hi guys, I am trying to generate my own features.tsv and labels.tsv for my dataset, but I am stuck at the following: 1. I have a slight confusion regarding what exactly these features are. Upon r…

sameerpande12 updated 2 years ago
6
OFA-Sys/OFA #239

RE Instructions used in OFA-CN

Hi, thanks again for contributing such good work. Just wondering have you revealed prompts(i.e., instructions) for several multi-modality tasks used in OFA-CN, especially for visual grounding task? th…

GeorgeLuImmortal updated 2 years ago
5
mlfoundations/open_flamingo #305

[FEATURE REQUEST] Enable Video Training

**Is your feature request related to a problem? Please describe.** I have been actively using this repository for multimodal training involving images and text. It has been incredibly helpful for my …

simplaj updated 2 months ago
2
dais-ita/interpretability-papers #38

Generating Visual Explanations

[Generating Visual Explanations](https://link.springer.com/chapter/10.1007/978-3-319-46493-0_1) Clearly explaining a rationale for a classification decision to an end user can be as important as the …

richardtomsett updated 6 years ago
1
TencentARC/ViT-Lens #14

InstructBLIP and SEED Implementation

Hi, I have checked the Clip-Vision embedding (last hidden state) of Blip2&InstructBlip on huggingface (instructblip-vicuna-7b), the dimension is 257x1408. However, the multi-modal matching space of Vi…

MichaelMaiii updated 8 months ago
2

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for captioning

1000+ results
for captioning