Open Rahul-Brito opened 2 years ago
To do any of the remaining tech items, i need to
[x] figure out how to remove lab tech from the recording using the pyannote APIs. could just cut out one minute but this seems unreliable, especially if she sometimes talks at the end. I don't want to lose too much data too. Once I do this I can try the above things
[x] Chunk out gemaps based on diarized audio
[x] Run he gemaps data through the same pipeline of dim reduc
[ ] CCA - look at whether F0 is represented in the embeddings or how much
[ ] Look at distances WITHIN participants - talk to Gasser
[x] On hausdorff distances - compare each cell (pairwise) instead of distributions
[x] Sweep different parameters for KNN, etc. (e.g. instead of 10 neighbors, sweep to 50), etc.
[x] Need to do a better job contextualizing the problem and space
[ ] Look at silhouette coefficient or rand-initi (?)
RA feature requests
Yep, that makes sense! The diarization pipeline itself we have works quite well it seems, so we should be able to separate mother from RA voice automatically. I am hoping/assuming we can do this for the baby cries too. I am not sure if any of our recordings got extended baby crying or not to test this, but I can check!