Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Problem Overview
I download the audios from the "FACodec: Voice Conversion Samples" and I use the python script shown in FACodec-README with pretrained model "FACodecEncoderV2/FACodecDecoderV2", but the voice conversion is not as good as the demo showcase or the results in https://huggingface.co/spaces/amphion/naturalspeech3_facodec
audio files is here: audio files "1_female_recon.wav" is the voice conversion audio by myself, "1_female_recon_huggingface.wav" is from https://huggingface.co/spaces/amphion/naturalspeech3_facodec