ml-explore / mlx-examples

Examples in the MLX framework
MIT License
6.08k stars 864 forks source link

[Feature Request] Support Speaker Diarization #1039

Open uniqueness-ae opened 1 week ago

uniqueness-ae commented 1 week ago

Implement speaker diarization for the existing mlx whisper support to:

  1. Enhance transcription accuracy in multi-speaker conversations
  2. Distinguish between different speakers in the output
  3. Improve overall usability of the transcription feature

This addition will provide more insightful and structured transcripts, making it easier to analyze and understand complex audio content. Thanks

Hoohm commented 4 days ago

Would love to see this as well. I can help out in making the feature but I need some pointers as to how it would be possible.