Open cqjjjzr opened 5 years ago
Excellent! thank you for your interest and contributions!
Because it has been a long time since I implemented it, I can't exactly remember it in detail. However, the purpose is, implementing equation (7) in [1]. Also, it will be helpful to refer Fig. 2 in [1].
If there is some spare time for me, I can analyze the written code in detail, however, these day, I'm too busy. Thank you!
[1] X. Zhang and D. Wang, "Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 2, pp. 252-264, Feb. 2016.
Hi.
I'm trying to rewrite this project in C++ in search of better interoperability, better user friendliness and better performance.
Now I successfully implemented MRCG extraction and get a huge quality boost as well as a small memory usage. However I have some problem understanding the scripts that does the prediction. This script involves lots of array allocating and I want to know the purpose of every single line in order to write better implementation.
So, could you please kindly give an explanation of the bdnn_transform function?
Thanks in advance.