wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Apache License 2.0
727 stars 121 forks source link

Compression codec augmentation ? #364

Open tcourat opened 1 month ago

tcourat commented 1 month ago

Hi,

Is there a way to randomly add a codec compression as a data augmentation when training speaker embeddings ? Is it already done in current pre-trained models ?

Things like Opus, MP3 etc.. but also telephony like a-law, mu-law etc.

Thanks.

JiJiJiang commented 1 month ago

No, we did not do any codec augmentation in our training data pipeline. It would be very nice if you can contribute the codes with experimental results!