microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
MIT License
1.09k stars 113 forks source link

reproduction steps for inference #39

Open ghost opened 1 year ago

ghost commented 1 year ago

are all the required preprocessing steps:

  1. acquire dataset, checkpoint, source
  2. train spm
  3. hubert feature extraction
  4. run fairseq

or are there any missing parts?

ghost commented 1 year ago

Also, it's a bit opaque what we're meant to do with hubert, precisely

Ajyy commented 1 year ago

Could you give more details about inference, such as what tasks you are doing, what do you want to do?