Open giappham opened 5 years ago
I ran into this problem as well. I'm assuming these are private tools he developed but hasn't published.
@YongyuG Would it be possible for you to publish these? It looks like only speakin_voice_feats is actually used in the code.
Nevermind. I reimplemented the missing libraries myself. You can find them in my fork.
I couldn't get the conda install to work. Most of the dependencies aren't used and so old conda spends hours trying to resolve conflicts. However, I just created a Python virtualenv and that was enough to run everything.
This library is probably the fastest I've ever evaluated. It correctly parses the example1.wav file in a few seconds, whereas most take several minutes. However, it suffers from the same problem as most unsupervised methods in that it doesn't have very good accuracy with non-toy data and can't be corrected through training. When I run it on a short podcast snippet containing three people with similar sounding voices speaking closely together, it incorrectly places three speakers in the same segment.
Hi @YongyuG . How to download 2 libraries speakin_voice_feats and speakin_ivector in gl_bic_ahc_viterbi.py. I tried installing the environment.yml file but the server died. Many thanks.