MainRo / deep-speaker

An implementation of deep speaker from baidu
MIT License
6 stars 3 forks source link

Select MFCC library with python bindings #1

Closed MainRo closed 5 years ago

MainRo commented 6 years ago

An MFCC library is needed to extract features. It should be implemented in rust, with python bindings via cbindgen and cffi. some crates are availabe:

we must test them to see if they can provide the same result than the default tensorflow one, and allow for more configuration.

Otherwise a new library can be written, preferably on top of an existing dct library. One is available here:

If no serious option exists, then fftw can be used as a base of a new mfcc rust library:

aouinizied commented 5 years ago

Another option is to use Speechpy: https://github.com/astorfi/speechpy0