This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.
Hello, I would like to ask why the sampling rate is defined as 16000 in the model SincConv, but the sampling rate specified in the training is 24000. Will this sampling rate affect the training process?
Hello, I would like to ask why the sampling rate is defined as 16000 in the model SincConv, but the sampling rate specified in the training is 24000. Will this sampling rate affect the training process?