-
Hello,
Glad to see so much active work in this field, and that you could use our recently released ThermoScenes dataset!
We would appreciate if you can properly reference/cite the source of the …
-
I think a common use case is to toggle between voice and text mode (like in the ChatGPT app among others).
If the goal is to create a multimodal framework that can easily toggle between modalities,…
-
self.nummasks, self.masklen = self.masks.shape
ValueError: not enough values to unpack (expected 2, got 0)
I hope to be able to get help. Thank you very much.
-
Hi!
I have another questions: Once a model is trained, say using 'T1', 'T2' as inputs, and 'T2FLAIR' as output, how can I extract the partial model with only 'T1' as input and 'T2FLAIR' as output?
…
-
**Submitting author:** @souzatharsis (Thársis T. P. Souza)
**Repository:** https://github.com/souzatharsis/podcastfy
**Branch with paper.md** (empty if default branch):
**Version:** v0.2.17
**Editor:…
-
### Feature request / 功能建议
Hi, is it possible to use the image embedding seperately to do image retrieval based on a query?
### Motivation / 动机
Want to do RAG on images.
### Your contribution / …
-
Hi,
Thanks for your paper and code for "Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis". It's an amazing job.
I want ask a question about val_…
-
## Introduction
We can envision and consider client-side, server-side and third-party speech recognition, synthesis and translation scenarios for a next version of the Web Speech API.
## Advanci…
-
Thanks to your kindly uploaded multimodal brain synthesis code, I was able to make an attempt to create a FLAIR image from a multi contrast image. My input contrasts are [T1, T2, PD], and the output i…
-
# Few-Shot Unsupervised Image-to-Image Translation #
- Author: Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz
- Origin: https://arxiv.org/abs/1905.01723
-…