Closed kthakore closed 3 years ago
Starting with the microspeech with fft fix. @Michael-F-Bryan might have a better idea here. https://github.com/kthakore/json-eater
!!! If we can test proc blocks in python that is HUGE deal
Need to make an implementation in Rust (copying over a python function) for microspeech. More notes from @meelislootus .
Notes on hotg drive: https://docs.google.com/document/d/1IeJjxcj8VIca_nFGxnNVsbxsuQvGmnI0-Lga0Wy5Tg8/edit#
Overall summary:
Here’s the TF Ops repo with all the parts to the TF spectrogram-computer, in C/C++ implementation: TF spectrogram-computer repo
The steps in the TF spectrogram-computer (they are all sequentially called from frontend.c) are, with links to the relevant code:
I think a reasonable plan to match the model might be (given that especially step 5 might be quite complicated):
It looks like microspeech is good so I'll close this and #113.
Re-train our existing models using data from the phone.