Adds support for Speech Recognition network. Here are some model from open model zoo. You can pick anyone from those model or any other pose estimation model you are prefer
mozilla-deepspeech-0.6.1quartznet-15x5-enwav2vec2-base
The potential work should be:
Add new sub-class as child of base input to parse the new input type for speech.
Adds support for Speech Recognition network. Here are some model from open model zoo. You can pick anyone from those model or any other pose estimation model you are prefer mozilla-deepspeech-0.6.1 quartznet-15x5-en wav2vec2-base
The potential work should be: