Hello, I'm trying to train my own dataset with Japanese speakers.
As I'm a beginner at programming I don't understand your source cords nicely.
What does "if str(x.shape) == '(513, 800, 3)': " mean??
Is this an output size determined by sox operation??
Hello, I'm trying to train my own dataset with Japanese speakers. As I'm a beginner at programming I don't understand your source cords nicely. What does "if str(x.shape) == '(513, 800, 3)': " mean?? Is this an output size determined by sox operation??
Anyway, thank you very much!!