About the released Processed Data

thuiar / MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

MIT License

642 stars 104 forks source link

Thank you for your wonderful work! But I have some questions about the released dataset in BaiduYun Disk: The Processed Data in MOSEI have 74 and 35 feature dimensions for audio and vision modality, which can be figured out that these two modalities features are extracted by COVAREP and Facet. However, the Processed Data in MOSI have 5 and 20 feature dimensions for audio and vision modality, what feature extractor did you use to extract MOSI data? As far as I know, MOSI audio and vision modality features extrated by COVAREP and Facet have 74 and 47 feature dimensions. Thank you for your answer!

thuiar / MMSA

About the released Processed Data #48