which have different formats with the audio titles in raw audio folders for each language, for example in the folder audios/lt/, there is:
ls | head -n 5
20090112-0900-PLENARY-10_lt_1079616_1086270.ogg
20090112-0900-PLENARY-10_lt_1133568_1136670.ogg
20090112-0900-PLENARY-10_lt_1238304_1242270.ogg
20090112-0900-PLENARY-10_lt_1288704_1292862.ogg
20090112-0900-PLENARY-10_lt_1288704_1296606.ogg
So how do these two formats align with each other? I thought they could somehow be the same number pairs, but there are actually not.
During data preparation of speech matrix, for aligned_speech tsv files, the files shown as:
which have different formats with the audio titles in raw audio folders for each language, for example in the folder audios/lt/, there is:
So how do these two formats align with each other? I thought they could somehow be the same number pairs, but there are actually not.
Could anybody help? Thank you so much!