cmusphinx / node-pocketsphinx

Pocketsphinx bindings for Node.JS
Other
242 stars 47 forks source link

Arabic model returns only silence #34

Closed ahmedalbakiri closed 7 years ago

ahmedalbakiri commented 7 years ago

Can I used it to recognition any language or just for English ?

nshmyrev commented 7 years ago

You can find models for other supported languages in our downloads area.

ahmedalbakiri commented 7 years ago

So, I have already build my acoustic model and LM and dic, and the sound is in .wav. can this work there?

2016-10-05 10:58 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

Closed #34 https://github.com/cmusphinx/node-pocketsphinx/issues/34.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#event-812880727, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABHv4zv1sa9ZUoqvMrvTIEESJt_18ks5qw1i8gaJpZM4KOM4b .

nshmyrev commented 7 years ago

Yes

ahmedalbakiri commented 7 years ago

Thanks, for your fast response,

I have implemented it and I have got this results,

_exports_Hypothesis { prob: -4802, bestScore: -5894, hypstr: '' } INFO: ngram_search.c(1027): bestpath 0.00 CPU 0.000 xRT INFO: ngram_search.c(1030): bestpath 0.00 wall 0.000 xRT 9 15

16 192 193 380 381 440 # 441 590 why text that recognize is empty? Is it because the text I work on is in arabic or what? thanks in advance. 2016-10-05 14:09 GMT+03:00 Nickolay V. Shmyrev notifications@github.com: > Yes > > — > You are receiving this because you authored the thread. > Reply to this email directly, view it on GitHub > https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-251646070, > or mute the thread > https://github.com/notifications/unsubscribe-auth/AVlABLMf4tLYS-nEcZ3VuVynB8c3E_FFks5qw4WDgaJpZM4KOM4b > .
nshmyrev commented 7 years ago

Please provide more complete data about your problem - what do you run exactly, what is exactly in the log output, what is your data file, what are your models, etc.

ahmedalbakiri commented 7 years ago

All you need it is in attachment.

2016-10-05 16:41 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

Please provide more complete data about your problem - what do you run exactly, what is exactly in the log output, what is your data file, what are your models, etc.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-251677732, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABDfCen4GGEEDnPJoWIrqedccd0U0ks5qw6kJgaJpZM4KOM4b .

nshmyrev commented 7 years ago

I'm sorry, I do not see any attachments. Maybe you can simply share the file with dropbox/google drive and give here a link.

ahmedalbakiri commented 7 years ago

The below is the link in google drive.

https://drive.google.com/open?id=0BxN0qMy1FI89cS03OWIyemFXUHM

2016-10-05 17:38 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

I'm sorry, I do not see any attachments. Maybe you can simply share the file with dropbox/google drive and give here a link.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-251693986, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABBNpCe7VjJpnZLvM2GRJZaayniomks5qw7ZcgaJpZM4KOM4b .

ahmedalbakiri commented 7 years ago

Mr Nickolay, did you found where is the problem, I try to solve it, but I still get the same output

9 15

16 192 193 380 381 440 441 590 its give all the .wav is sil, where there is sound in that file, what is going on? I have search no solution is found? where is the problem? 2016-10-05 17:59 GMT+03:00 ahmed Parakletos ahmedalbakiri@gmail.com: > The below is the link in google drive. > > https://drive.google.com/open?id=0BxN0qMy1FI89cS03OWIyemFXUHM > > 2016-10-05 17:38 GMT+03:00 Nickolay V. Shmyrev notifications@github.com: > > > I'm sorry, I do not see any attachments. Maybe you can simply share the > > file with dropbox/google drive and give here a link. > > > > — > > You are receiving this because you authored the thread. > > Reply to this email directly, view it on GitHub > > https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-251693986, > > or mute the thread > > https://github.com/notifications/unsubscribe-auth/AVlABBNpCe7VjJpnZLvM2GRJZaayniomks5qw7ZcgaJpZM4KOM4b > > .
nshmyrev commented 7 years ago

I checked your files. I do not think your model is properly trained. For example, your dictionary doesn't follow tutorial recommendation to use phones or word-specific phones. Your phones are whole syllables which is not a good idea. I also not not know how much training data do you have, what is the error rate during training and so on.

I recommend you to fix model training first and share the model training folder if you still have problems.

ahmedalbakiri commented 7 years ago

I train using context independent (ci), the WER and the results I have in this link,

https://drive.google.com/open?id=0BxN0qMy1FI89UHVmVHd2RlBhZTA

2016-10-09 19:21 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

I checked your files. I do not think your model is properly trained. For example, your dictionary doesn't follow tutorial recommendation to use phones or word-specific phones. Your phones are whole syllables which is not a good idea. I also not not know how much training data do you have, what is the error rate during training and so on.

I recommend you to fix model training first and share the model training folder if you still have problems.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-252495991, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABCE2mXOLguRbm-Sezp2xQb1f7jI1ks5qyRRtgaJpZM4KOM4b .

nshmyrev commented 7 years ago

You need to use proper phoneset first. Then you need to share a whole model training folder.

ahmedalbakiri commented 7 years ago

The whole model is in link, and the results I have is good with these .dic file as you see in results file.

https://drive.google.com/open?id=0BxN0qMy1FI89UHVmVHd2RlBhZTA

2016-10-09 20:30 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

You need to use proper phoneset first. Then you need to share a whole model training folder.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-252499961, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABJHBpzngEWKF1Mqep19MAjQzIKPwks5qySTKgaJpZM4KOM4b .

nshmyrev commented 7 years ago

You do not have required amount of data for training too. As I said, follow the tutorial recommendations in order to get a working system.

ahmedalbakiri commented 7 years ago

do you mean increase the number of sound, to train different speakers?

2016-10-09 20:54 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

You do not have required amount of data for training too. As I said, follow the tutorial recommendations in order to get a working system.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-252501310, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABAGy_Xz_nE5oddrByVBZBPSPXQScks5qySo_gaJpZM4KOM4b .

nshmyrev commented 7 years ago

http://cmusphinx.sourceforge.net/wiki/tutorialam

  • 1 hour of recording for command and control for single speaker
  • 5 hour of recordings of 200 speakers for command and control for many speakers
  • 10 hours of recordings for single speaker dictation
  • 50 hours of recordings of 200 speakers for many speakers dictation
ahmedalbakiri commented 7 years ago

Thanks, I now modify the dict and got a results, but its not good results, it is recognition is bad

_exports_Hypothesis { prob: 0, bestScore: -22760, hypstr: 'لَمْ' } INFO: ngram_search.c(1027): bestpath 0.00 CPU 0.000 xRT INFO: ngram_search.c(1030): bestpath 0.00 wall 0.000 xRT 9 13 لَمْ 14 590 لَمْ

why its in previous previous dic produce a good results in the results folder based WER, but in the pocketsphinx node give a SIL, and now after modification to dic file give results, its not that good, but there is some output?

2016-10-09 21:14 GMT+03:00 Nickolay V. Shmyrev notifications@github.com:

http://cmusphinx.sourceforge.net/wiki/tutorialam

  • 1 hour of recording for command and control for single speaker
  • 5 hour of recordings of 200 speakers for command and control for many speakers
  • 10 hours of recordings for single speaker dictation
  • 50 hours of recordings of 200 speakers for many speakers dictation

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cmusphinx/node-pocketsphinx/issues/34#issuecomment-252502547, or mute the thread https://github.com/notifications/unsubscribe-auth/AVlABAjdOJerML-c1g8iiANgXUv6DA39ks5qyS77gaJpZM4KOM4b .

nshmyrev commented 7 years ago

why its in previous previous dic produce a good results in the results folder based WER, but in the pocketsphinx node give a SIL, and now after modification to dic file give results, its not that good, but there is some output?

When you do not have enough data results are unstable.