wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
https://wenet-e2e.github.io/wenet/
Apache License 2.0
4.18k stars 1.08k forks source link

WeNet c++ inference #1802

Closed Kevindurant111 closed 10 months ago

Kevindurant111 commented 1 year ago

Hi, guys. Recently, I have been working on developing C++ routines for Wenet. I have completed the development work, and the accuracy is aligned with your Python examples. This C++ post-processing module is based on Slyne's work, and it mainly involves some interface refactoring. This means that you no longer need to recompile third-party libraries like fst and KenLM when using these post-processing interfaces. Additionally, I have provided some utility functions, such as ctc_decoding and file reading functions. By the way, I am also developing a speech pre-processing module based on Armadillo, which is mainly used to extract fbank or mfcc features from WAV files. The reason for this is that Kaldi has too many dependencies, which is not very user-friendly for C++ users. I will gradually improve this module, and it will be open-sourced soon. I hope these are useful to you.

jayakrishnanmm commented 1 year ago

Hi , @Kevindurant111 may you share your code ? I can test .

Kevindurant111 commented 1 year ago

Hi , @Kevindurant111 may you share your code ? I can test .

pls see my homepage(currently only post-processing module are available)

github-actions[bot] commented 11 months ago

This issue has been automatically closed due to inactivity.