-
The following tasks are available in the model hub, and seem to have inference support, but are not yet listed in the [Inference API docs](https://huggingface.co/docs/api-inference/detailed_parameters…
-
### Feature request
Implement the new feature to support a pipeline that can take both an image and text as inputs, and produce a text output. This would be particularly useful for multi-modal tasks …
-
Hi there!
First of all, thank you so much for all of your work and the time put into answering everyone's questions in the Issues section!
I've been trying to finetune Donut for French visual q…
-
Hello, I thoroughly enjoyed reading your paper, "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering."
I am writing to ask about the code provided for the paper. I am tryi…
-
**System Information (please complete the following information):**
Windows OS: Windows-11-Enterprise-22H2
ML.Net Model Builder 2022: 17.17.17.2407507 (Main Build)
Microsoft Visual Studio Enterpris…
-
**System Information (please complete the following information):**
Windows OS: Windows-11-Enterprise-22H2
ML.Net Model Builder 2022: 17.17.0.2360601 (Main Build)
Microsoft Visual Studio Enterprise…
-
-
## Computer Vision:
- [x] Add Depth Estimation pipeline
- [ ] Add Image Classification pipeline
- [ ] Add Image Segmentation pipeline
- [ ] Add Mask Generation pipeline
- [ ] Add Object Detecti…
-
HI @SatyamGaba
I do all step by step but
when running python3 make_vacabs_for_questions_answers.py --input_dir='../COCO-2015'
I see some error could you help me please
ubuntu@ubuntu-VirtualBox…
-
### Prerequisites
- [X] I am running the latest code. Mention the version if possible as well.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…