openspeech-team / openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
https://openspeech-team.github.io/openspeech/
MIT License
670 stars 112 forks source link

A way to execute hydra_train.py for Ksponspeech #191

Open taejin0128 opened 1 year ago

taejin0128 commented 1 year ago

❓ Questions & Help

  1. Is it possible to make a STT model for converting bilingual speech to two different languages? For example, converting human speech(voice) to text '현재 비행기가 Turbulence로 인해 흔들리고 있습니다.)

    1-1) If it is possible, how can I train the model?

  2. If I want to train a Korean language data for a specific field, which one is a common method to train additional data on a pre-trained model? Method 1. step 1) Train the Ksponspeech data & Make a model step 2) Train a Korean language data of a specific field with the model generated in step 1

    Method 2. (I think that it needs too much efforts and time to transform the data for training together) step 1) Make a combined dictionary for Ksponspeech data and the other Korean language data step 2) Train all the data(Ksponspeech & other Korean langauge data) together

  3. For improving terminology recognition for a specific field, is it adequate to train addition data on a pre-trained model (which is trained with Ksponspeech data and "character" mode") with the "subword" option?

Details