facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
MIT License
29.81k stars 6.31k forks source link

strange unstable wav2vec2 pretraining #4948

Open Mohammadtvk opened 1 year ago

Mohammadtvk commented 1 year ago

❓ Questions and Help

I am trying to pretrain wav2vec2 on persian language using common voice dataset. I did not modify anything but the dataset path in configs. here is plots of training metrics

image image image image

as can be seen in above figures, after few epochs grad norm and accuracy drops, code perpelexity spike, and loss increases and then stays almost flat. why is that ?

What's your environment?

tarudesu commented 1 year ago

@Mohammadtvk Hi there, I have the same problem with you. Could I know that how can you fix that one, please?

StudentofSong commented 10 months ago

Hi there, I have the same problem with you. Have you solved your problem.Could I know that how can you fix that one, please?