georgepar / kaldi-grpc-server

Deploy Kaldi models using grpc for bidirectional streaming.
Apache License 2.0
17 stars 6 forks source link

Added diarization component #6

Closed GeoRouv closed 1 year ago

GeoRouv commented 2 years ago

Changes

Kaldigrpc:

Scripts:

Test Usage

SAD results

You can find them in: _/kaldigrpc/SADmodel/data/segments The format of the "segments" file is: \<utterance-id> \<recording-id> \<segment-begin> \<segment-end>

Diarization results

You can find them in: _/kaldigrpc/diarizermodel/data/rttm The format of the "rttm" file is: SPEAKER \<recording-id> 1 \<start time> \<duration> NA NA \<speaker-id> NA NA

Notes

If you want to perform only diarization there is no need to run separately SAD as it's already executed in it's initial stage. In case you want to inspect the SAD results for the diarization process, they can be found in: _/kaldigrpc/diarizermodel/data/segments

TODO