Videos are not well-aligned with the texts.

Obviously this issue was already brought up at https://github.com/declare-lab/MELD/issues/9

The alignment is pretty bad. It's hard for me to go multimodal at the moment, because of this issue.

I have two questions:

Has this been fixed? Or are you planning on using a better alignment tool?
Can I have access to the original friends videos? I wonder if I can cut the videos into utterances myself using ASR.

declare-lab / MELD