Hello! Just wondered if I use this command to obtain alignment, what's the default behavior? Say, will the speaker adaptation or fine-tuning (fine-tune the pretrained model on the target data) happen?
Just realized that --single_speaker is not related to speaker adaptation at all, while on the other hand the speaker adaptation option --uses_speaker_adaptation false is just fixed recently. BTW, this option is not mentioned in the doc. So I hope to double check.
As of 3.0.0a5, --single_speaker will imply --uses_speaker_adaptation false, since speaker adaptation doesn't add anything in this case. I've updated the docs to reflect this as well.
Hello! Just wondered if I use this command to obtain alignment, what's the default behavior? Say, will the speaker adaptation or fine-tuning (fine-tune the pretrained model on the target data) happen?
mfa align --overwrite --clean --single_speaker ${data} english_mfa english_mfa ${data}/alignment
Just realized that
--single_speaker
is not related to speaker adaptation at all, while on the other hand the speaker adaptation option--uses_speaker_adaptation false
is just fixed recently. BTW, this option is not mentioned in the doc. So I hope to double check.