Global conditioning on speaker identification

basveeling / wavenet

Keras WaveNet implementation

https://soundcloud.com/basveeling/wavenet-sample

1.06k stars 218 forks source link

Global conditioning on speaker identification #13

Open basveeling opened 8 years ago

basveeling commented 8 years ago

And perhaps using a keras embedding layer to learn a representation for speakers?

malzzz commented 7 years ago

I'm actually very interested in this as well. Just to clarify, are you referring to something like FaceNet for voice? Have you done any more research into this area as of late?

basveeling commented 7 years ago

I haven't, but this would be interesting! I don't have any access to NN training hardware right now, but I'd love to see if this works.

faroit commented 7 years ago

would this paper be relevant here?

Also there are a few deep embedded clustering implementations around. Also one in Keras: https://github.com/fferroni/DEC-Keras but I don't know if this one is well tested