Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: Identifying Source Speakers for Voice Conversion based Spoofing Attacks
on Speaker Verification Systems
summary: An automatic speaker verification system aims to verify the speaker identity
of a speech signal. However, a voice conversion system manipulates the original
person's speech signal to make it sound like the target speaker's voice and
deceive the speaker verification system. Most countermeasures for voice
conversion-based spoofing attacks are designed to discriminate bona fide speech
from spoofed speech for speaker verification systems. In this paper, we
investigate the problem of source speaker identification -- inferring the
identity of the source speaker given the voice converted speech. To perform
source speaker identification, we simply add voice-converted speech data with
the label of source speaker identity to the genuine speech dataset during
speaker embedding network training. Experimental results show the feasibility
of source speaker identification when training and testing with converted
speeches from the same voice conversion model(s). When testing on converted
speeches from an unseen voice conversion algorithm, the performance of source
speaker identification improves when more voice conversion models are used
during training.
Thunk you very much for contribution!
Your judgement is refrected in arXivSearches.json, and is going to be used for VCLab's activity.
Thunk you so much.
Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
summary: An automatic speaker verification system aims to verify the speaker identity of a speech signal. However, a voice conversion system manipulates the original person's speech signal to make it sound like the target speaker's voice and deceive the speaker verification system. Most countermeasures for voice conversion-based spoofing attacks are designed to discriminate bona fide speech from spoofed speech for speaker verification systems. In this paper, we investigate the problem of source speaker identification -- inferring the identity of the source speaker given the voice converted speech. To perform source speaker identification, we simply add voice-converted speech data with the label of source speaker identity to the genuine speech dataset during speaker embedding network training. Experimental results show the feasibility of source speaker identification when training and testing with converted speeches from the same voice conversion model(s). When testing on converted speeches from an unseen voice conversion algorithm, the performance of source speaker identification improves when more voice conversion models are used during training.
id: http://arxiv.org/abs/2206.09103v1
judge
Write [vclab::confirmed] or [vclab::excluded] in comment.