bytedance / neurst

Neural end-to-end Speech Translation Toolkit
Other
298 stars 45 forks source link

MT training error #60

Open HildaNya opened 2 years ago

HildaNya commented 2 years ago

I'm training the MT in must-c example by modifying the given examples to the MT equivalence, namely

python3 -m neurst.cli.run_exp \
    --config_paths /mt/zh/mt_training_args.yml,/mt/zh/mt_validation_args.yml \
    --hparams_set speech_transformer_s \
    --model_dir /mt/mt_benchmark

However, I've encountered

    fake_src = numpy.random.rand(1, 4, src_meta["audio_feature_dim"], src_meta["audio_feature_channels"])
KeyError: 'audio_feature_dim'

This is only happening to my MT training. The ASR is running smoothly. Also, I double-checked and I'm pretty sure all required packages are installed and match the dependencies.

Finally, I'm curious if separately training the ASR and MT is equivalent to the "Cascaded" model, or do I have to use cascade_st tool to achieve a cascaded setup?

Thanks.