Xflick / EEND_PyTorch

A PyTorch implementation of End-to-End Neural Diarization
MIT License
98 stars 15 forks source link

Question on single audio file inference. #1

Closed tumbleintoyourheart closed 4 years ago

tumbleintoyourheart commented 4 years ago

Hi,

First of all I want to give a thank to you for open-sourcing this work,

Secondly, does it support do inference on a single audio file?

Xflick commented 4 years ago

It does support a single .wav file inference. Though in the inference phase you need to specify a data_dir with a wav.scp file in that directory, your wav.scp file may contain only one audio file.

tumbleintoyourheart commented 4 years ago

Does it mean one can comments all the training part but the inferring+scoring then use the pretrained models with prepared wav.scp file containing the (list of?) test files?

Xflick commented 4 years ago

Yes. Or you may alternatively set stage=5 in run.sh file, which only does inferring and scoring. But I strongly recommend you split a little part of your test data to do adaptation, which may greatly boost the system performance.

tumbleintoyourheart commented 4 years ago

Thanks, I really appreciate such quick and informative responses.