I attempted to use Hugging Face's code to decode with the Whisper-Large model for SEAME. In comparison to the decoding results for Whisper-Default mentioned in Table 4 of the paper,
there seems to be a discrepancy between our results, possibly due to differences in language identification. Your results are devman: MER 51.55 and devsg: MER 61.36, while my results are devman: MER 38.2 and devsg: MER 65.0. Were these results also obtained using greedy search for decoding?
Hi,
I attempted to use Hugging Face's code to decode with the Whisper-Large model for SEAME. In comparison to the decoding results for Whisper-Default mentioned in Table 4 of the paper,
there seems to be a discrepancy between our results, possibly due to differences in language identification. Your results are devman: MER 51.55 and devsg: MER 61.36, while my results are devman: MER 38.2 and devsg: MER 65.0. Were these results also obtained using greedy search for decoding?