Closed hugddygff closed 5 years ago
The link in the README should point to the visual and acoustic features averaged across the temporal dimension. For the original video data, you can download them here. For more info on these datasets, you can check out CMU-MultimodalSDK.
sorry, I have made a spelling mistake in the question. I mean the original features without averaging along the temporal dimension. Instead of with, thanks!
For that, you could also refer to the Multimodal SDK mentioned above.
Hi, Justin, Can you share the original face features and audio features with averaging along the temporal dimension? and where can I download the original video data of the three datasets?
Thanks very much!