rmusser01 / tldw

Too Long, Didn't Watch(TL/DW): Your Personal Research Multi-Tool - Open Source NotebookLM
Apache License 2.0
45 stars 2 forks source link

Improvement: Allow for speaker diarization through non-HF APIs (Offline Diarization) #12

Open rmusser01 opened 1 month ago

rmusser01 commented 1 month ago

A user should be able to request speaker diarization without using a HF token, to allow for self-hosted model usage.

A user may pass an argument to force speaker diarization.

A user may pass an additional argument if speaker diarization is forced, which specifies whether or not the script will use HF or their own self-hosted model.

See https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/community/offline_usage_speaker_diarization.ipynb for reference.

rmusser01 commented 2 weeks ago

Models are in, code needs to be modified to work.