Open dietercastel opened 3 years ago
Hi Dieter,
My apologies for not replying faster, for some reason I don't get notifications reliably on GitHub.
I indeed had the models locally. I am not entirely sure if I am able to share the models as the data was licensed to me with an academic license. I also believe that the VUB actually owns the actual models as I made them during my bachelor thesis (this particular point I find quite weird but it is what I was told). You can however easily create your own models! You can use the code in Import-CGN https://github.com/wilrop/Import-CGN to preprocess the Corpus Gesproken Nederlands and use these audio files to train models on the latest deepspeech version. I have had the best performance by using a pre-trained English model and finetuning that on my Dutch audio files. Good luck!
Willem
Edit: I just saw I didn't answer the last part of your question. So far, there has not been any follow-up research. However, that might change in the future depending on the specific circumstances.
Do you think that you could just ask for permission from VUB to submit your trained models to? https://github.com/coqui-ai/STT-models
There is only one other Dutch model and its tiny. Most of us, actually dont have the skills to build our own language models. I think there is no issue with releasing speech models trained on licensed data, as long as you didn't already agree not to.
Unfortunately, I really cannot do that. In addition, the performance of my language model will most likely be genuinely horrible compared to what modern models are able to do.
Hi Willem,
I just stumbled upon the Demo of DuStt. Very interesting! But since I couldn't find the trained models (and suspect you had them locally as per:) https://github.com/wilrop/DuStt-Demo/blob/a0956acbfe70c51d160a4b25b649d2b3ab30879c/stt4dutch/inference/functions.py#L5 I was wondering whether you still have them somewhere and if so if I could try them out some time?
If licenses permit it of course, I think they would also be welcomed in this DeepSpeech topic: https://discourse.mozilla.org/t/links-to-pretrained-models/62688
Finally I was also wondering whether there has been any follow up research of this project?
Looking forward to reading from you! kr, Dieter