Open Rambo06 opened 3 years ago
This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!
❓ Questions and Help
Hello, I used XLSR-53 for topic classification on the Chinese dataset and the accuracy rate can reach 80%. But when I use 1200 hours of Chinese data for self-supervised training after 50 epochs. The accuracy becomes 50%.
I would like to ask, how many hours of data does self-training XLSR-53 need? How many epochs need to be trained ? Did I do something wrong?
I use the following code: https://github.com/mailong25/self-supervised-speech-recognition
python3 pretrain.py --fairseq_path path/to/libs/fairseq --audio_path path/to/audio_directory --init_model path/to/xlsr_53_56k.pt
Thanks for your answer!