Hi,
I am struggling to find any support for developing a model for a multi-model dataset .
Can you please guide or give me some reference for using this dataset and developing a LSTM or CNN model.
I am new to this field but for now i am able to develop model for images and text separately but having trouble in using a merged input (image+text or image+audio ).
Please provide some direction or explain with respect to the baseline model you provided.
Thanks
Hi, I am struggling to find any support for developing a model for a multi-model dataset . Can you please guide or give me some reference for using this dataset and developing a LSTM or CNN model. I am new to this field but for now i am able to develop model for images and text separately but having trouble in using a merged input (image+text or image+audio ).
Please provide some direction or explain with respect to the baseline model you provided. Thanks