Open WilliamVenner opened 1 week ago
You can either return embeddings from the two lines below https://github.com/k2-fsa/sherpa-onnx/blob/a5295aad10ea932279b415cd573e57273926a69b/sherpa-onnx/csrc/offline-speaker-diarization-pyannote-impl.h#L146-L147
or you can use the diarization results to re-compute the embeddings.
I managed to DIY it. Probably not the best implementation, so I won't PR, but here it is: https://github.com/WilliamVenner/sherpa-onnx/commit/0d533de5451b9ba1f204428b8d154580b707d835#diff-dabb58cf56f7c8b62cb621374dc40f77696e653c14af9bb62ef1790d66d4b174
My task combines both speaker diarization and speaker identification.
Since speaker embeddings are extracted during diarization anyway, it would be fantastic if the user could extract speaker embeddings from the speaker diarization segments/labels as well.
This would allow users to perform speaker identification against an existing speaker diarization result, thereby applying their own identified labels for the speakers and therefore simplify this task's pipeline.