Open Superman-Valencia opened 1 year ago
In metadata.csv, there are paths, transcripts and speaker_labels. So, Do the training datasets have to have corresponding text? If l remove the text in the code, can the model work normally?
You don't need to have any text information. Used internally to verify that wav2vec is working properly.
In metadata.csv, there are paths, transcripts and speaker_labels. So, Do the training datasets have to have corresponding text? If l remove the text in the code, can the model work normally?