visual-question-answering Search Results

1000+ results
for visual-question-answering

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #831

Fine-tune a LLaVa model for Visual question answering task o…

### Question I want to fine-tune a LLaVa model for Visual question answering task on some custom set of images. I wanted to know the Dataset format required for training and then fine-tuning. I found…

anjanakg updated 11 months ago
6
huggingface/huggingface.js #174

Add inference demos

Add demos on https://huggingface.co/huggingfacejs (feel free to contribute demos, or to ask joining the organization) ### Natural Language processing - [ ] Fill mask - [ ] Summarization - [ ] …

coyotte508 updated 1 year ago
14
mlflow/mlflow #10860

[BUG] spark_udf() issue with custom transformers pipeline

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### Where…

kgoz12 updated 5 months ago
8
ikethepike/dungeons-and-dashboards #5

Establish basic visual style

We should discuss a basic visual language, which I think either goes full fantasy or a stripped back modern design: Questions that need answering: - [ ] Skinning down the road? Could there be say …

ikethepike updated 5 years ago
7
shikras/shikra #46

I have collected the download addresses for all the training…

I am reproducing the model on V100 GPU. If anyone is doing the same, I hope we can communicate and exchange ideas together. My wechat : Anymake_ren 1、Flickr 30k ： http://shannon.cs.illinois.edu/D…

Anymake updated 1 month ago
5
allegroai/clearml #1055

Reports: Ablation Study table

## Motivation Many research projects include an [Ablation Study](https://en.wikipedia.org/wiki/Ablation_(artificial_intelligence)) to compare model performance in the presence/absence of a combinatio…

InonS updated 2 months ago
4
NVIDIA/kvpress #23

Unknown task kv-press-text-generation

KeyError: "Unknown task kv-press-text-generation, available tasks are ['audio-classification', 'automatic-speech-recognition', 'depth-estimation', 'document-question-answering', 'feature-extraction', …

Dominic789654 updated 1 week ago
1
batmanlab/BatmanLabWiki #34

Paper stack from CVPR 2018- Sumedha

- [ ] [ Neural Baby Talk](http://openaccess.thecvf.com/content_cvpr_2018/papers/Lu_Neural_Baby_Talk_CVPR_2018_paper.pdf) Keywords: Image captioning predict template-like sentences Reference: [Hy…

sumedhasingla updated 6 years ago
2
alibaba/AliceMind #61

Pretrained weights for downstream tasks for mPLUG?

Currently, only the pretrained weights before fine-tuning on downstream tasks for mPLUG are released. Is it possible to release the pretrained weights for downstream tasks after fine-tuning, like visu…

qiaomu-miao updated 2 years ago
1
salesforce/LAVIS #309

what is the difference between the Instructed Zero-shot Imag…

In my understanding, VQA is similar with the ability of zero-shot image-to-text generation mentioned in the BLIP2 paper. They all give the answer about prompt(question / natural language instructions)…

gyula-coder updated 1 year ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for visual-question-answering

1000+ results
for visual-question-answering