For some reason, the speaker assigned to a given word is not necessarily the same as the speaker assigned to the utterance that word is part of. You can see this in the test data for the utterance "I only need a few months." The utterance-level speaker is "SPEAKER_00" but the word-level speaker for "I" is "SPEAKER_01" (see logs)
The consequences of this are unclear, but documenting the bug in case it becomes a future issue. In some future release it would be nice to have some kind of sanity check on the output to ensure that the subdivision speaker assignments are consistent with their parent assignment.
Contact Details
No response
What happened?
For some reason, the speaker assigned to a given word is not necessarily the same as the speaker assigned to the utterance that word is part of. You can see this in the test data for the utterance "I only need a few months." The utterance-level speaker is "SPEAKER_00" but the word-level speaker for "I" is "SPEAKER_01" (see logs)
The consequences of this are unclear, but documenting the bug in case it becomes a future issue. In some future release it would be nice to have some kind of sanity check on the output to ensure that the subdivision speaker assignments are consistent with their parent assignment.
What operating system are you using?
Ubuntu
Relevant log output
Code of Conduct