Topic classification about XLSR-53，did I do something wrong?

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT License

30.27k stars 6.39k forks source link

❓ Questions and Help

Hello， I used XLSR-53 for topic classification on the Chinese dataset and the accuracy rate can reach 80%. But when I use 1200 hours of Chinese data for self-supervised training after 50 epochs. The accuracy becomes 50%.

I would like to ask, how many hours of data does self-training XLSR-53 need? How many epochs need to be trained ？ Did I do something wrong?

I use the following code： https://github.com/mailong25/self-supervised-speech-recognition python3 pretrain.py --fairseq_path path/to/libs/fairseq --audio_path path/to/audio_directory --init_model path/to/xlsr_53_56k.pt

Thanks for your answer!

facebookresearch / fairseq

Topic classification about XLSR-53，did I do something wrong? #3406

❓ Questions and Help