captioning Search Results

1000+ results
for captioning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

salesforce/LAVIS #31

Adding dataset for VQA task

Hi, how can I add the visual7w dataset for the VQA task? The adding datasets documentation is for AVSD task and I'm not sure how to do similar steps but for a different task... My data has images, que…

AditiSharma97 updated 2 years ago
1
AkihikoWatanabe/paper_notes #1435

COM Kitchens: An Unedited Overhead-view Video Dataset as a …

# URL - https://arxiv.org/abs/2408.02272 # Affiliations - Koki Maeda, N/A - Tosho Hirasawa, N/A - Atsushi Hashimoto, N/A - Jun Harashima, N/A - Leszek Rybicki, N/A - Yusuke Fukasawa, N/A …

AkihikoWatanabe updated 1 month ago
1
fulfulggg/Information-gathering #188

見るか推測するか：反事実的に正則化された画像キャプション生成

## タイトル: 見るか推測するか：反事実的に正則化された画像キャプション生成 ## リンク: https://arxiv.org/abs/2408.16809 ## 概要: 画像の内容を自然言語で記述する画像キャプション生成は、視覚と言語の研究において重要なタスクです。従来のモデルは、既存のデータセットの統計的な適合を通じて、機械の生成能力を人間の知能に近づけることで、このタスクに取り組…

fulfulggg updated 2 months ago
2
HVision-NKU/StoryDiffusion #115

故事提示词中的 [NC] 和 # 分别是表示什么意思啊？

Echo411 updated 5 months ago
1
AUTOMATIC1111/stable-diffusion-webui #8725

[Feature Request]: add blip2 model to "Preprocess images".

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What would your feature do ? It will use blip2 models for text desc of i…

goblin776655 updated 1 year ago
4
fengyang0317/unsupervised_captioning #26

error

from config import TF_MODELS_PATH ImportError: cannot import name 'TF_MODELS_PATH' from 'config' (D:\python3.7\lib\site-packages\config\__init__.py) what's the mean of TF_MODELS_PATH? I can't find…

feng-yun-anhui updated 3 years ago
15
pzzhang/VinVL #3

How to decode feature files?

Hello! Thanks for your wonderful work. May I know how to decode GQA pretrained feature files? Specifically, how to convert the base64 encoded features (data in features.tsv) to floating points? Thanks…

Zhonghao2016 updated 3 years ago
2
linjieyangsc/densecap #3

Unable to find train.txt, val.txt, test.txt in corresponding…

I want to reproduce your code and want to run it on Visual Genome1.4 dataset. However, I cannot find the corresponding TXT file for train, val, and test when loading the dataset. Can you put these thr…

ZhiqiangZ updated 5 years ago
1
snap-research/Panda-70M #50

The performance for video caption seems poor

Hello, I used the code and weights you provided to execute the inference.py file, but the results seem to be very different from what is shown. Do you know what is the reason for this please? ![ima…

Hyu-Zhang updated 6 months ago
1
shap/shap #3036

[Meta-issue] Notebooks are outdated / non-runnable

## Background We should also make sure that our documentation is kept up to date. A scour through the open issues in this repo and also on [StackOverflow](https://stackoverflow.com/questions/tag…

thatlittleboy updated 3 months ago
9

上一页 1...76 77 78 79 80 81 82...100 下一页

1000+ results for captioning

1000+ results
for captioning