Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for
Improved Dysarthric Speech Recognition
summary: In this paper, we investigate several existing and a new state-of-the-art
generative adversarial network-based (GAN) voice conversion method for
enhancing dysarthric speech for improved dysarthric speech recognition. We
compare key components of existing methods as part of a rigorous ablation study
to find the most effective solution to improve dysarthric speech recognition.
We find that straightforward signal processing methods such as stationary noise
removal and vocoder-based time stretching lead to dysarthric speech recognition
results comparable to those obtained when using state-of-the-art GAN-based
voice conversion methods as measured using a phoneme recognition task.
Additionally, our proposed solution of a combination of MaskCycleGAN-VC and
time stretched enhancement is able to improve the phoneme recognition results
for certain dysarthric speakers compared to our time stretched baseline.
Thunk you very much for contribution!
Your judgement is refrected in arXivSearches.json, and is going to be used for VCLab's activity.
Thunk you so much.
Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
summary: In this paper, we investigate several existing and a new state-of-the-art generative adversarial network-based (GAN) voice conversion method for enhancing dysarthric speech for improved dysarthric speech recognition. We compare key components of existing methods as part of a rigorous ablation study to find the most effective solution to improve dysarthric speech recognition. We find that straightforward signal processing methods such as stationary noise removal and vocoder-based time stretching lead to dysarthric speech recognition results comparable to those obtained when using state-of-the-art GAN-based voice conversion methods as measured using a phoneme recognition task. Additionally, our proposed solution of a combination of MaskCycleGAN-VC and time stretched enhancement is able to improve the phoneme recognition results for certain dysarthric speakers compared to our time stretched baseline.
id: http://arxiv.org/abs/2201.04908v1
judge
Write [vclab::confirmed] or [vclab::excluded] in comment.