guxm2021 / MM_ALT

[MM 2022 Oral] MM-ALT: A Multimodal Automatic Lyric Transcription System
Apache License 2.0
16 stars 0 forks source link

miss key "label" in main.py #1

Open xiaoxue1117 opened 1 year ago

xiaoxue1117 commented 1 year ago

Cannot access key "label" in the row 334 for main.py under ImuVAD directory.

Sonata165 commented 1 year ago

Thanks for following our work! The data for VAD training of IMU encoder is separated from the ALT training. The format of data is a bit different. Let me add the data to the Zenodo now ... Before that, you can use a checkpoint of VAD model instead so that no need to train yourself. I'll upload it to the ImuVAD folder.

Sonata165 commented 1 year ago

(They are in 5-second segment format instead of utterance-level, with some additional columns)

xiaoxue1117 commented 1 year ago

(They are in 5-second segment format instead of utterance-level, with some additional columns)

Thank you for your reply! If you can also update the fusion module for multi-modal training, that would be great!