Open GeorgeS2019 opened 3 years ago
Thank you @GeorgeS2019 for mentioning. For context, voice100 is my personal TTS/ASR project with CNN layers without recursion for embedding in mobile apps Xamarin Android sample . It is not based on research papers. I think it has poor documentation and I am working on it. Please let me know if you have any idea how to improve.
@kaiidams also provides ONNX model for ASR (Automatic Speech Recognition ) based on QuartzNet of NVidia NeMo
The ONNX has been tested in Godot
Check out the readme.md for performance and accuracy!
Evaluate if the following ONNX address the speech--audio-processing category