georgid / AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
http://mtg.upf.edu/node/3751
GNU Affero General Public License v3.0
56 stars 6 forks source link

reduce dependency on htk and scikit learn #40

Closed georgid closed 7 years ago

georgid commented 7 years ago

make sure extracting MFCC with essentia same as damp model:

dont use scikit learn at all, keep LyricsWIthModelsGMM class for chinese.

georgid commented 7 years ago

only dependency on htkparser solved