insunhwang89 / StyleVC

MIT License
30 stars 3 forks source link

About the requirement of dataset #11

Open Superman-Valencia opened 1 year ago

Superman-Valencia commented 1 year ago

In metadata.csv, there are paths, transcripts and speaker_labels. So, Do the training datasets have to have corresponding text? If l remove the text in the code, can the model work normally?

insunhwang89 commented 1 year ago

You don't need to have any text information. Used internally to verify that wav2vec is working properly.