Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: The exploitation of Multiple Feature Extraction Techniques for Speaker
Identification in Emotional States under Disguised Voices
summary: Due to improvements in artificial intelligence, speaker identification (SI)
technologies have brought a great direction and are now widely used in a
variety of sectors. One of the most important components of SI is feature
extraction, which has a substantial impact on the SI process and performance.
As a result, numerous feature extraction strategies are thoroughly
investigated, contrasted, and analyzed. This article exploits five distinct
feature extraction methods for speaker identification in disguised voices under
emotional environments. To evaluate this work significantly, three effects are
used: high-pitched, low-pitched, and Electronic Voice Conversion (EVC).
Experimental results reported that the concatenated Mel-Frequency Cepstral
Coefficients (MFCCs), MFCCs-delta, and MFCCs-delta-delta is the best feature
extraction method.
Thunk you very much for contribution!
Your judgement is refrected in arXivSearches.json, and is going to be used for VCLab's activity.
Thunk you so much.
Please check whether this paper is about 'Voice Conversion' or not.
article info.
title: The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices
summary: Due to improvements in artificial intelligence, speaker identification (SI) technologies have brought a great direction and are now widely used in a variety of sectors. One of the most important components of SI is feature extraction, which has a substantial impact on the SI process and performance. As a result, numerous feature extraction strategies are thoroughly investigated, contrasted, and analyzed. This article exploits five distinct feature extraction methods for speaker identification in disguised voices under emotional environments. To evaluate this work significantly, three effects are used: high-pitched, low-pitched, and Electronic Voice Conversion (EVC). Experimental results reported that the concatenated Mel-Frequency Cepstral Coefficients (MFCCs), MFCCs-delta, and MFCCs-delta-delta is the best feature extraction method.
id: http://arxiv.org/abs/2112.07940v1
judge
Write [vclab::confirmed] or [vclab::excluded] in comment.