Rules using command chaining and Dictation don't work for the Sphinx engine

dictation-toolbox / dragonfly

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

GNU Lesser General Public License v3.0

387 stars 75 forks source link

I've been doing a little more investigation into this lately and fixing this completely is absurdly difficult to do IMO.

The best way of having this functionality is probably by using n-gram language models interspersed with grammar literals like Silvius does. I believe that CMU Pocket Sphinx and Kaldi (what Silvius uses) both support the same ARPA n-gram language model format. Having that would get rid of utterance breaks altogether and simplify the engine considerably. I'm not sure where to start with implementing that though.

For the moment, I think it would be best to have separate JSGF rules for mappings using Dictation by changing dragonfly2jsgf. Repeatable mappings that don't have Dictation can then still be repeated without the current performance issues, you'll just need to pause before starting mappings that use Dictation.

I've deleted the feature/repetition branch as it was very out of date and the approach is a dead end.

dictation-toolbox / dragonfly

Rules using command chaining and Dictation don't work for the Sphinx engine #4