missing speech features for LSMDC dataset

gabeur / mmt

Multi-Modal Transformer for Video Retrieval

Apache License 2.0

259 stars 41 forks source link

Hi, thanks for sharing the code.

I noticed that the speech features are missing for all video clips in the LSMDC.tar.gz. But the paper mentioned that

I watched some original video clips from LSMDC dataset and found that they are all with audio where speech transcripts can be extracted using the Google Cloud Speech to Text API.

Therefore, my question is that did you train the MMT model with speech features on LSMDC dataset. If so, what's the result and would you please share the speech feature files? If not, why did't you utilize the speech features?

I would appreciate your reply.

gabeur / mmt

missing speech features for LSMDC dataset #13