Open zeynabyousefi opened 2 weeks ago
Hi, @zeynabyousefi ! We used MFA v1.0. As for the English model, meta.yaml file states that it was version v0.9.0, architecture gmm+hmm, feats mfcc+deltas.
Thanks . I am training the Encoder Diff VC model using the LJSpeech dataset. Currently, I am facing some issues with data preprocessing and setting up input parameters. I would appreciate any guidance on the appropriate configuration for data preprocessing and input parameters.
Additionally, I've encountered errors while running the get_avg_mels.ipynb file, which seem to be due to mismatches in sample rates, audio features (such as MFCC or Mel spectrogram), or other processing parameters.
If specific settings are required for data preprocessing and input parameters, please provide detailed instructions.
Thank you in advance for your assistance!
@ytyeung @wenyong-h @ivanvovk @huawei-noah-admin
Hello, I would like to know the exact version of Montreal Forced Alignment (MFA) used in this project. I need to confirm the version to ensure compatibility with other project components.
@ytyeung @wenyong-h @ivanvovk @huawei-noah-admin