Open 12-zhx opened 1 year ago
Although the filter length is fixed, we can easily obtain filters with different resolutions by interpolating to handle the variable-length inputs during inference. A more detailed explanation is provided in Sec. II.B of the arXiv version of the paper.
Why do you set the value of "T" in the global branch as 200? But length of speech is variable, the value is still 200 in the test phrase.