reduce dependency on htk and scikit learn

georgid / AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

http://mtg.upf.edu/node/3751

GNU Affero General Public License v3.0

56 stars 6 forks source link

reduce dependency on htk and scikit learn #40

Closed georgid closed 7 years ago

georgid commented 7 years ago

make sure extracting MFCC with essentia same as damp model:

add preempahsis (or recreate model without preemphasis )
add cepstral mean normalization

dont use scikit learn at all, keep LyricsWIthModelsGMM class for chinese.

test on Jingju

georgid commented 7 years ago

only dependency on htkparser solved