This may be something obvious/trivial, but I'm new to the topic and not much of a mathmatician.
In your MFCC + DTW.ipynb exaple the output is 192.489808008.
How can I convert it to 0-1 range to judge it against a similarity threshold?
I will use this to determine if the recording is authentic vs deepfaked voice.
This may be something obvious/trivial, but I'm new to the topic and not much of a mathmatician.
In your
MFCC + DTW.ipynb
exaple the output is192.489808008
.How can I convert it to 0-1 range to judge it against a similarity threshold?
I will use this to determine if the recording is authentic vs deepfaked voice.