-
The provided datasets have four variants, each serving a specific purpose, and contain a `text_description` as described below E.g gov:
1. **syntheticDocQA_government_reports_test** – **No text_des…
-
Hello! Could you please add SALMONN series models?
Title | Venue | Date | Code | Demo
-- | -- | -- | -- | --
[SALMONN: Towards Generic Hearing Abilities for Large Language Models](https://arxiv.o…
-
Thank you for your meaningful work.
I would like to ask that how the events defined in the video data? In other words, how to segment a video into multi-event segments? Thanks
-
Great work!
How to perform the task of generating video captions?
-
**Describe the problem to be solved**
Peertube's Live system is a really powerful and useful feature. Peertube's subtitle support for stored videos (and the ability to add them after the fact) is…
-
### Feature request
Implement the new feature to support a pipeline that can take both an image and text as inputs, and produce a text output. This would be particularly useful for multi-modal tasks …
-
I'm traying to test the network on my windows 10 notebook. I configure all the packages but when the test start it gives me the next error:
Traceback (most recent call last):
File "", line 1, in…
-
### System Info
- `transformers` version: 4.47.0.dev0
- Platform: Linux-5.15.0-94-generic-x86_64-with-glibc2.35
- Python version: 3.10.15
- Huggingface_hub version: 0.26.2
- Safetensors version: …
-
Hello,
I have successfully generated all features (both text and visual) for the COCO dataset. However, when running MLE training, the code throws the following error at the moment it starts valida…
-
## Project Request
Image Captioning with Deep Learning
This project aims to develop a model that automatically generates descriptive captions for images.
---
| Field | Description …