ZhongshuHou / LSA

Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)
MIT License
25 stars 6 forks source link

iSTFT_module_1_8 is a wrong function #5

Open yezhangyinge opened 2 weeks ago

yezhangyinge commented 2 weeks ago

Hello! Thank you for your great work. But I found that the iSTFT_module_1_8 is a wrong function, which does not work well when the frame shift is 8ms and the frame len is 32ms (which means 3/4 overlap). I carefully checked the value of the result of the process:

  1. test_sample= torch.randn((512*100), dtype=torch.float32)*100
  2. do stft
  3. use iSTFT_module_1_8 to get new_sample
  4. print(torch.abs(test_sample-new_sample).mean()) After I do these, I found that the mean value of abs is very big, which means the calcucation of this function is wrong. Did I miss something? Could you give me some advise? My wechat is yezhangyinge, could you add me if you are convenient? Thank you very much!