-
Good Morning. How to visualize the predicted caption?
-
VTT
-
### Describe the request
I want MP4s to be importable alongside GIF and PNG/JPG.
### Is your feature request related to a problem? Please describe.
Since discord converts GIFs to MP4s all the time,…
-
# Music Caption Generation
Creating natural language descriptions for clips of music
## Task Objective
The primary goal of the Music Caption Generation task is to create concise, natural lang…
-
# Task Name: Audio Caption Generation
Generating natural language description for any kind of audio in the wild.
## Task Objective
The task of Audio Caption Generation involves a model receivin…
-
Great work to start with !
I am sure you will upgrade this marvel frequently
To start with its a great UI. just need few tweaks.
1stly, I noticed that even after caption generation, offloading do…
-
Dear Author,
I hope this message finds you well. I've been working with the model. The provided [README](https://github.com/acharkq/MolCA/blob/main/README.md) includes a script for fine-tuning spec…
-
In Case That the GPUs our idle, we can always use them for generating high-quality captions for finetuning Image Generation Models later.
I am working on a docker file together with Andreas That con…
-
Hi,
Thanks a lot for releasing codes for such a great work. Will you release text generation or text filter codes for convert TCGA captions into question-answer pairs? It would be a large help for…
-
After training, when I use the model to generate captions. It starts giving me the below error:
`File "caption.py", line 215, in
seq, alphas = caption_image_beam_search(encoder, decoder, args.i…