Open laleye opened 2 years ago
Hi, maybe your data are not in the correct format?
The input to the model has to be single-channel and sampled at 16kHz. You can convert them with the following command:
ls ${path_to_wavs}/*.* | parallel -j 4 ffmpeg -i {} -ac 1 -ar 16000 -hide_banner -loglevel error {.}.wav
Thank for your reply. All data was already in this format, however I still converted again but it remained without success. I always have the same error.
Could you maybe try with a standard dataset like MuST-C, to see whether the problem is in the data?
@johntsi I will try it and let you know.
I'm trying to reuse your interesting code for speech translation on my own data. I get the following size error with lna_ed configuration:
Do you know what I'm doing wrong?