-
### Question
I want to fine-tune a LLaVa model for Visual question answering task on some custom set of images. I wanted to know the Dataset format required for training and then fine-tuning. I found…
-
Add demos on https://huggingface.co/huggingfacejs (feel free to contribute demos, or to ask joining the organization)
### Natural Language processing
- [ ] Fill mask
- [ ] Summarization
- [ ] …
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Where…
-
We should discuss a basic visual language, which I think either goes full fantasy or a stripped back modern design:
Questions that need answering:
- [ ] Skinning down the road? Could there be say …
-
I am reproducing the model on V100 GPU. If anyone is doing the same, I hope we can communicate and exchange ideas together. My wechat : Anymake_ren
1、Flickr 30k :
http://shannon.cs.illinois.edu/D…
-
## Motivation
Many research projects include an [Ablation Study](https://en.wikipedia.org/wiki/Ablation_(artificial_intelligence)) to compare model performance in the presence/absence of a combinatio…
InonS updated
2 months ago
-
KeyError: "Unknown task kv-press-text-generation, available tasks are ['audio-classification', 'automatic-speech-recognition', 'depth-estimation', 'document-question-answering', 'feature-extraction', …
-
- [ ] [ Neural Baby Talk](http://openaccess.thecvf.com/content_cvpr_2018/papers/Lu_Neural_Baby_Talk_CVPR_2018_paper.pdf)
Keywords:
Image captioning
predict template-like sentences
Reference: [Hy…
-
Currently, only the pretrained weights before fine-tuning on downstream tasks for mPLUG are released. Is it possible to release the pretrained weights for downstream tasks after fine-tuning, like visu…
-
In my understanding, VQA is similar with the ability of zero-shot image-to-text generation mentioned in the BLIP2 paper. They all give the answer about prompt(question / natural language instructions)…