juanmc2005 / diart

A python package to build AI-powered real-time audio applications
https://diart.readthedocs.io
MIT License
898 stars 76 forks source link

Add model arguments to `diart.stream`, `diart.benchmark` and `diart.tune` #79

Closed juanmc2005 closed 1 year ago

juanmc2005 commented 1 year ago

Problem

Since pyannote/segmentation and pyannote/embedding seem to be aliases to the newest pyannote.audio releases, the instructions for reproducibility keep getting out of date and diart scripts don't allow you to choose a different checkpoint.

Currently the only option for users to choose a different model (even if it's only loading a different one from huggingface) is writing a custom script nearly identical to the existing ones.

Idea

Add arguments to diart scripts so that users can choose the pyannote models to load. For example, the segmentation model should be pyannote/segmentation@Interspeech2021 when reproducing the paper's results.

Example

diart.stream --segmentation pyannote/segmentation@Interspeech2021 --embedding pyannote/embedding
juanmc2005 commented 1 year ago

Implemented in #77