Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: Replay Spoofing Countermeasure Using Autoencoder and Siamese Network on
ASVspoof 2019 Challenge
summary: Automatic Speaker Verification (ASV) is the process of identifying a person
based on the voice presented to a system. Different synthetic approaches allow
spoofing to deceive ASV systems (ASVs), whether using techniques to imitate a
voice or recunstruct the features. Attackers try to beat up the ASVs using four
general techniques; impersonation, speech synthesis, voice conversion, and
replay. The last technique is considered as a common and high potential tool
for spoofing purposes since replay attacks are more accessible and require no
technical knowledge from adversaries. In this study, we introduce a novel
replay spoofing countermeasure for ASVs. Accordingly, we used the Constant Q
Cepstral Coefficient (CQCC) features fed into an autoencoder to attain more
informative features and to consider the noise information of spoofed
utterances for discrimination purpose. Finally, different configurations of the
Siamese network were used for the first time in this context for
classification. The experiments performed on ASVspoof challenge 2019 dataset
using Equal Error Rate (EER) and Tandem Detection Cost Function (t-DCF) as
evaluation metrics show that the proposed system improved the results over the
baseline by 10.73% and 0.2344 in terms of EER and t-DCF, respectively.
Thunk you very much for contribution!
Your judgement is refrected in arXivSearches.json, and is going to be used for VCLab's activity.
Thunk you so much.
Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: Replay Spoofing Countermeasure Using Autoencoder and Siamese Network on ASVspoof 2019 Challenge
summary: Automatic Speaker Verification (ASV) is the process of identifying a person based on the voice presented to a system. Different synthetic approaches allow spoofing to deceive ASV systems (ASVs), whether using techniques to imitate a voice or recunstruct the features. Attackers try to beat up the ASVs using four general techniques; impersonation, speech synthesis, voice conversion, and replay. The last technique is considered as a common and high potential tool for spoofing purposes since replay attacks are more accessible and require no technical knowledge from adversaries. In this study, we introduce a novel replay spoofing countermeasure for ASVs. Accordingly, we used the Constant Q Cepstral Coefficient (CQCC) features fed into an autoencoder to attain more informative features and to consider the noise information of spoofed utterances for discrimination purpose. Finally, different configurations of the Siamese network were used for the first time in this context for classification. The experiments performed on ASVspoof challenge 2019 dataset using Equal Error Rate (EER) and Tandem Detection Cost Function (t-DCF) as evaluation metrics show that the proposed system improved the results over the baseline by 10.73% and 0.2344 in terms of EER and t-DCF, respectively.
id: http://arxiv.org/abs/1910.13345v1
judge
Write 'confirmed' or 'excluded' in [] as comment.