Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: A single speaker is almost all you need for automatic speech recognition
summary: We explore the use of speech synthesis and voice conversion applied to
augment datasets for automatic speech recognition (ASR) systems, in scenarios
with only one speaker available for the target language. Through extensive
experiments, we show that our approach achieves results compared to the
state-of-the-art (SOTA) and requires only one speaker in the target language
during speech synthesis/voice conversion model training. Finally, we show that
it is possible to obtain promising results in the training of an ASR model with
our data augmentation method and only a single real speaker in different target
languages.
Thunk you very much for contribution!
Your judgement is refrected in arXivSearches.json, and is going to be used for VCLab's activity.
Thunk you so much.
Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: A single speaker is almost all you need for automatic speech recognition
summary: We explore the use of speech synthesis and voice conversion applied to augment datasets for automatic speech recognition (ASR) systems, in scenarios with only one speaker available for the target language. Through extensive experiments, we show that our approach achieves results compared to the state-of-the-art (SOTA) and requires only one speaker in the target language during speech synthesis/voice conversion model training. Finally, we show that it is possible to obtain promising results in the training of an ASR model with our data augmentation method and only a single real speaker in different target languages.
id: http://arxiv.org/abs/2204.00618v1
judge
Write [vclab::confirmed] or [vclab::excluded] in comment.