facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
MIT License
30.59k stars 6.41k forks source link

Pretraining Hubert #5512

Closed shirounanashi closed 5 months ago

shirounanashi commented 5 months ago

What is your question?

I'm trying to train a Hubert model, and I'm having the problem that the .km files are coming out in ANSI instead of UTF-8. I'm using colab to create these files

Code

https://colab.research.google.com/drive/1Ju6sLIoHhIl5LEtyaWMuDuryDt7qbGJs?usp=sharing

What have you tried?

Training a hubert model

What's your environment?

shirounanashi commented 5 months ago

Forget it, I didn't run part of the code that was necessary to run