Open ra100 opened 10 months ago
Same here. I tried to use the translator object to get the phonetic units but with no success.
text_supersede, unit_output_supersede = translator.predict( example["fbank"], "S2ST", tgt_lang=target_language_code, src_lang=target_language_code, text_generation_opts=text_generation_opts, unit_generation_ngram_filtering=False, duration_factor=1.0, prosody_encoder_input=prosody_encoder_input, src_text='voici un test', # for mintox check )
Is it possible to modify the resulting translated text before it's transformed to audio?
I want to S2ST with expressions, but if the translation doesn't quite match the context or whatever, to "force" the resulting translation with one or two words changed.