visual-question-answering Search Results

1000+ results
for visual-question-answering

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Kardbord/hfapigo #14

Add support for undocumented tasks

The following tasks are available in the model hub, and seem to have inference support, but are not yet listed in the [Inference API docs](https://huggingface.co/docs/api-inference/detailed_parameters…

Kardbord updated 4 months ago
4
huggingface/transformers #34169

Image-Text-to-Text Support in Transformers Pipeline

### Feature request Implement the new feature to support a pipeline that can take both an image and text as inputs, and produce a text output. This would be particularly useful for multi-modal tasks …

chakravarthik27 updated 1 month ago
2
clovaai/donut #236

Difficulties finetuning for another language

Hi there! First of all, thank you so much for all of your work and the time put into answering everyone's questions in the Issues section! I've been trying to finetune Donut for French visual q…

lauraminkova updated 1 month ago
12
zhangbin-ai/APL #2

Extract visual features and bounding box features

Hello, I thoroughly enjoyed reading your paper, "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering." I am writing to ask about the code provided for the paper. I am tryi…

nanacoco419 updated 2 months ago
1
dotnet/machinelearning-modelbuilder #2830

Object Detection-Azure: The error message is truncated on th…

**System Information (please complete the following information):** Windows OS: Windows-11-Enterprise-22H2 ML.Net Model Builder 2022: 17.17.17.2407507 (Main Build) Microsoft Visual Studio Enterpris…

v-Hailishi updated 2 weeks ago
1
dotnet/machinelearning-modelbuilder #2800

Question Answering: The error message is not very clear afte…

**System Information (please complete the following information):** Windows OS: Windows-11-Enterprise-22H2 ML.Net Model Builder 2022: 17.17.0.2360601 (Main Build) Microsoft Visual Studio Enterprise…

v-Hailishi updated 1 week ago
4
xinwei666/MMGenerativeIR #1

OKVQA - GS112K是怎么通过OKVQA得到的

MathamPollard updated 1 month ago
1
kadirnar/ComfyUI-Transformers #12

ROADMAP of ComfyUI-Transformers

## Computer Vision: - [x] Add Depth Estimation pipeline - [ ] Add Image Classification pipeline - [ ] Add Image Segmentation pipeline - [ ] Add Mask Generation pipeline - [ ] Add Object Detecti…

kadirnar updated 5 months ago
1
SatyamGaba/visual_question_answering #1

Error IsDirectoryError when Preproccess input data

HI @SatyamGaba I do all step by step but when running python3 make_vacabs_for_questions_answers.py --input_dir='../COCO-2015' I see some error could you help me please ubuntu@ubuntu-VirtualBox…

kobrafarshidi updated 2 years ago
2
ggerganov/llama.cpp #9246

Feature Request: Support for Qwen2-VL

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

isr431 updated 9 hours ago
76

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for visual-question-answering

1000+ results
for visual-question-answering