Closed MalikMahnoor closed 4 years ago
While we plan to target other languages, we haven't made any decision as to which is the next language to target yet. If you've sufficient speech data for Urdu, thousands of hours of speech, we'd be willing to help in modifying our code for Urdu and lending some server resources for training.
Actually we are trying to make changes in spell.py and text.py for urdu language, and also working for language model in urdu.We have a corpus of urdu on which we will be doing our training.Is this the right approach ?
@MalikMahnoor Sounds about right. (I'd have to see the details to be sure.) How large an Urdu corpus do you have?
700 sentences along with their audios ..but we are using this just to make a prototype..we can even collect more dataset..if this corpus shows good results
Sent from my T-Mobile 4G LTE Device
And by the way your spell.py and text.py is working fine for urdu as well.We have made our language model ,changed the dataset to urdu..The code works fine till the creation of execution context..It gives error on training.The errors to our understanding are because of n_characters (which we have changed too to no of characters in urdu)but there are other errors too.
Sent from my T-Mobile 4G LTE Device
Could you post the errors you're getting? Maybe we can help.
We have managed to fix those errors..now it goes in to training..the code works fine now.. but only for isolated words not sentences .We are trying to fix text.py for that.Hopefully we ll be able to do that within a few days
Sent from Yahoo Mail on Android
On Thu, Jul 13, 2017 at 8:27 PM, Kelly Davisnotifications@github.com wrote:
Could you post the errors you're getting? Maybe we can help.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
Awesome!
Thanks !
Sent from Yahoo Mail on Android
On Thu, Jul 13, 2017 at 8:35 PM, Kelly Davisnotifications@github.com wrote:
Awesome!
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
@MalikMahnoor When you get an Urdu model up and running and want to distribute it to the world, we'd be happy to help host the model for you. Providing, say, S3 storage so others can download the model.
Hi @MalikMahnoor I am also working on Urdu Speech Recognition but using a different approach. I have already tried single speaker 700 sentences corpus recorded by Agha Ali. It is not useful corpus and now planning to add data from new sources. We can collaborate. thanks
@abbasrazaali @MalikMahnoor I would suggest you take a look also at Common Voice, they are working on localization and internationalization, that would help you augment the corpus.
@kdavis-mozilla Are there any specific requirements for audio recordings you need? What if, we provide you, thousands hours of recordings of Urdu TV/radio. Please specify, if there are any such requirements. Can you please also explain, what type of code changes are needed for accomplishing Urdu support?
@sajjadsaleem I don't know if there are hard an fast requirements. However, there are some things which we have found to work.
As for supporting Urdu you'll need to make changes similar to those required for French support which is described here[1] or German described here[2].
hi its me sehar gul deep speech is new for me i have to train it for urdu language can u help me how to train it for urdu language??
@sehargul A good start is the discourse post[1]; further discussion can be had there.
Any updates on the Urdu model?
Hi. I couldn't find spell.py file in DeepSpeech Master - Version 0.2.0 alpha 0. what could be the substitute of it ? Thank you!
@kdavis-mozilla, can you please answer my query? What could be substitute of spell.py file in Deepspeech master Version 0.2.0 alpha0. Thank you!
@Hafsa26 There have been a lot of changes since spell.py was in the repo. Could you say a little more about what you want to do?
@Hafsa26 There have been a lot of changes since spell.py was in the repo. Could you say a little more about what you want to do?
I am working on Urdu Language Speech Recognition system using DeepSpeech. As you said above, we need to make changes in text.py and spell.py for it. I found text.py in repo but couldn't find spell.py. So what could be the solution for it? Secondly, if you have any blog or help for speech recognition system of some other language using Deepspeech. Kindly please share. Thank you!
@Hafsa26 I guess I'm looking more towards: What your goal? spell.py
is no longer in the repo, but the functionality it provided is. So, I need to know what functionality you are trying to use so I can point you in the right direction.
@MalikMahnoor Dear what is the status of your work on Urdu language model ? can you share ?
@kdavis-mozilla I want to create my own language model based on Urdu language. Can you please help me in this matter ? I've collected approximately 9000 audio recorded files in Urdu voice of 100 different sentences. Currently i am training this data with Roman transcription but i want to train it with Urdu transcription.
@kdavis-mozilla I want to create my own language model based on Urdu language. Can you please help me in this matter ? I've collected approximately 9000 audio recorded files in Urdu voice of 100 different sentences. Currently i am training this data with Roman transcription but i want to train it with Urdu transcription.
What's wrong in the current documentation ? There should be everything documented for you to achieve that.
@lissyx can you please elaborate which documentation you are talking about? or share that documentation here. As I've never found any for languages other than English
What about README.md
? I really don't understand what's blocking you.
@lissyx - the README.md
has all the info needed, but I will admit it's hard to pick it out for newcomers... maybe it's time to write a blogpost for "how to train DeepSpeech on a new language"?
@lissyx - the
README.md
has all the info needed, but I will admit it's hard to pick it out for newcomers... maybe it's time to write a blogpost for "how to train DeepSpeech on a new language"?
Maybe, but again, if we don't know the pain points, it's less efficient. If you ask me, it's trivial and all properly documented. Obviously it's not the case, and thus I'm unsure I can produce anything more useful than the existent documentation.
I've been running into all the pain points getting DS to work with all the CV langs, so I definite could write up that post... I'm just concerned about how much time it would take - a week or so I'd guess.
When I finish the Windows parts I'll start working on it for Spanish, @JRMeyer I can share with you the "hardest parts" if you want.
@lissyx - the
README.md
has all the info needed, but I will admit it's hard to pick it out for newcomers... maybe it's time to write a blogpost for "how to train DeepSpeech on a new language"?
it would be very helpful indeed.
What about
README.md
? I really don't understand what's blocking you.
I just need to know does DeepSpeech supports RTL transcription like Arabic and Urdu ?
@waqasr6 I know developers outside of Mozilla have used it for Urdu, but we at Mozilla have never used it for such.
I just need to know does DeepSpeech supports RTL transcription like Arabic and Urdu ?
What kind of constraints do you have in mind ? We have support for UTF-8 so chars should be handled properly, and then RTL should not be a problem since this is how training will be done
@lissyx Thanks. Many things in my mind are cleared now. I'll try it with Urdu language model now.
@lissyx Hi, How to convert output_graph.pb model into .pbmm model ? I got my Urdu language model with .pb extension. Is there any way to convert into .pbmm ?
Thank you!
@Hafsa26 Have you read README.md
?
I did. to check the model, I need output_graph.pbmm but I got output_graph.pb Do I need to make some changes to get .pbmm graph rather than .pb graph.
I think what lissyx is referring to is this.
Thank you so much!
Do you mind sharing figures on how well your model performs? You also might want to export it to tflite format for Android support.
@lissyx yes, I would surely share soon. Up till now, I worked on 1 hour of data and the system is working fine. Though, I am getting 100% WER yet but I will tweak the model once I started working on 300 hours data. I initially have to prepare demo of DeepSpeech for Urdu Language.
If there is anything you can share to make it better, I would love to know.
I am not planning to use it on Android yet but I need, I will surely do it. Thank you for helping all the way.
Please avoid images
When I trained model for one hour, loss is gradually decreasing but after 14 epochs, its increasing for some epochs and decreasing for some epochs. What do you suggest in such scenario?
When I trained model for one hour, loss is gradually decreasing but after 14 epochs, its increasing for some epochs and decreasing for some epochs. What do you suggest in such scenario?
Not surprising with only one hour, nothing to conclude. You will have to adjust hyper-parameters, eventually, anyway.
I will. I will be using 300 hours of data next then I will be adjusting hyper-parameters accordingly. Is there any guide for adjusting hyper-parameters?
I wanted to use this model for urdu language .But I found this in FAQ '' DeepSpeech's requirements for the data is that the transcripts match the [a-z ]+ regex, and that the audio is stored WAV (PCM) files. ''
How can I design a neural network for speech transcription for languages like urdu ?