multimodal-synthesis Search Results

mzzcdf/Thermal3DGS #3

ThermoScenes dataset and paper referencing

Hello, Glad to see so much active work in this field, and that you could use our recently released ThermoScenes dataset! We would appreciate if you can properly reference/cite the source of the …

FlorentF9 updated 1 month ago

livekit/agents #791

Aggressive transcript mode / text response only mode

I think a common use case is to toggle between voice and text mode (like in the ChatGPT app among others). If the goal is to create a multimodal framework that can easily toggle between modalities,…

willsmanley updated 1 month ago

SeeMeInCrown/CoLa_Diff_MultiModal_MRI_Synthesis #4

not enough values to unpack (expected 2, got 0)

self.nummasks, self.masklen = self.masks.shape ValueError: not enough values to unpack (expected 2, got 0) I hope to be able to get help. Thank you very much.

xin-xin-zy updated 6 days ago

agis85/multimodal_brain_synthesis #3

How to load a partial model after training is done?

Hi! I have another questions: Once a model is trained, say using 'T1', 'T2' as inputs, and 'T2FLAIR' as output, how can I extract the partial model with only 'T1' as input and 'T2FLAIR' as output? …

trane293 updated 5 years ago

openjournals/joss-reviews #7432

[PRE REVIEW]: When Content Speaks Volumes: Podcastfy — An Op…

**Submitting author:** @souzatharsis (Thársis T. P. Souza) **Repository:** https://github.com/souzatharsis/podcastfy **Branch with paper.md** (empty if default branch): **Version:** v0.2.17 **Editor:…

editorialbot updated 3 days ago

THUDM/CogVLM2 #5

Use learned image-text embedding

### Feature request / 功能建议 Hi, is it possible to use the image embedding seperately to do image retrieval based on a query? ### Motivation / 动机 Want to do RAG on images. ### Your contribution / …

nivibilla updated 5 months ago

ApeironY/Modular-Trajectory-Prediction-Toolkit #2

Question about test_dataset and val_dataset

Hi, Thanks for your paper and code for "Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis". It's an amazing job. I want ask a question about val_…

YonghaoDong updated 2 years ago

WICG/speech-api #41

Client-side, Server-side and Third-party Speech Recognition,…

## Introduction We can envision and consider client-side, server-side and third-party speech recognition, synthesis and translation scenarios for a next version of the Web Speech API. ## Advanci…

AdamSobieski updated 6 years ago

agis85/multimodal_brain_synthesis #5

Which is the real result?

Thanks to your kindly uploaded multimodal brain synthesis code, I was able to make an attempt to create a FLAIR image from a multi contrast image. My input contrasts are [T1, T2, PD], and the output i…

kapmen updated 5 years ago

number9473/nn-algorithm #308

Few-Shot Unsupervised Image-to-Image Translation

# Few-Shot Unsupervised Image-to-Image Translation # - Author: Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz - Origin: https://arxiv.org/abs/1905.01723 -…

joyhuang9473 updated 5 years ago

105 results for multimodal-synthesis

105 results
for multimodal-synthesis