Closed leohuang2013 closed 2 years ago
Hi Liyi, Yes it should be. Although I haven't checked the Kaldi recipe for quite a while, there are some facts that might be helpful to you as I recalled:
utils/download_gigaspeech.sh was once named as utils/gigaspeech_download.sh, they are the same thing. Kaldi's recipe might be out-of-synced for this renaming.
If the dictionary is indeed needed(this is typical case in Kaldi's hybrid systems), you can feed the --with-dict
option to get it.
Thanks Jiayu for your quick reply. Should we take action for this issue, like create pull request or something else to facilitate trying gigaspeech by others.
Yes I believe creating a PR to fix the name inconsistency in Kaldi's gigaspeech recipe, will definitely help other users and save their time, that would be great!
local/gigaspeech_data_prep.sh calls utils/gigaspeech_download.sh, which does not exist in Gigaspeech Repo. It should be utils/download_gigaspeech.sh $gigaspeech
and in run.sh
it checks G2P model, which is supposed to be downloaded in utils/download_gigaspeech.sh when flag --with-dict is provided as 'true', by default it is 'false', hence it won't download G2P model. To solve this problem, need pass parameter '--with-dict true' in invoking utils/download_gigaspeech.sh.
Final modification for downloading gigaspeech would be
Is above right, or did I miss something?