huawei-noah / Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
564 stars 118 forks source link

Montreal Forced Alignment (MFA) Version Inquiry #39

Open zeynabyousefi opened 2 weeks ago

zeynabyousefi commented 2 weeks ago

Hello, I would like to know the exact version of Montreal Forced Alignment (MFA) used in this project. I need to confirm the version to ensure compatibility with other project components.

@ytyeung @wenyong-h @ivanvovk @huawei-noah-admin

li1jkdaw commented 1 week ago

Hi, @zeynabyousefi ! We used MFA v1.0. As for the English model, meta.yaml file states that it was version v0.9.0, architecture gmm+hmm, feats mfcc+deltas.

zeynabyousefi commented 6 days ago

Thanks . I am training the Encoder Diff VC model using the LJSpeech dataset. Currently, I am facing some issues with data preprocessing and setting up input parameters. I would appreciate any guidance on the appropriate configuration for data preprocessing and input parameters.

Additionally, I've encountered errors while running the get_avg_mels.ipynb file, which seem to be due to mismatches in sample rates, audio features (such as MFCC or Mel spectrogram), or other processing parameters.

If specific settings are required for data preprocessing and input parameters, please provide detailed instructions.

Thank you in advance for your assistance!

@ytyeung @wenyong-h @ivanvovk @huawei-noah-admin