I am trying to make a simple api where people can give an input speech and gets the output language identification class as result.
I created a dataset where all the samples are of length: 10 sec. So the model will be trained on 10 sec long samples.
What happens when at test time someone gives an audio smaller or greater then 10 seconds?
I am trying to make a simple api where people can give an input speech and gets the output language identification class as result. I created a dataset where all the samples are of length: 10 sec. So the model will be trained on 10 sec long samples. What happens when at test time someone gives an audio smaller or greater then 10 seconds?