Open VSK02101996 opened 2 years ago
Our code is based on MUlT
and supports CMU-MOSEI, CMU-MOSI and IEMOCAP. The preprocessed data can be obtained from the MUlT. If you want to use custom dataset, you should modify dataset.py and adjust the hyperparameters like hidden dim.
For custom data I need to preprocess using https://github.com/yaohungt/Multimodal-Transformer this right ?
For custom data I need to preprocess using https://github.com/yaohungt/Multimodal-Transformer this right ?
The benchmark datasets like MOSEI, MOSEI, and IEMOCAP can be downloaded directly in https://github.com/yaohungt/Multimodal-Transformer. For custom datasets, you should preprocess data following its standard practice or by yourself. This may require large modification and thus we recommend just referring to the code of our model and merging the model to your custom data pipeline. The default hyperparameter for the network can be referred in main.py.
What is meaning of standard practice like calculating mfcc , chroma etc or something else ? Also merging means creating model for custom data and merge that model with your model ; is it right ? @Trert111
Can you share the code how to do preprocessing of custom data ? @Trert111 . Atleast share how it is done on iemocap,mosi dataset ?
For MOSEI and MOSI, you can refer to CMU-MultimodalSDK. For IEMOCAP, you can refer to other related repositories. Our data is from here provided by MULT.
I don't understand this lines can you please elaborate ;
For custom datasets, you should preprocess data following its standard practice or by yourself. This may require large modification and thus we recommend just referring to the code of our model and merging the model to your custom data pipeline.
Since our code now only supports dataset of MOSEI, MOSEI, and IEMOCAP, the custom dataset may require rewriting a large portion of code, including dataset, dataloader, hidden dim, and even the training&testing procedures. Therefore, you can write your own code for your dataset and refer to what you need in our code. The core model is in modules.
How it will work for some kind of reviews or interviews (real world) ?