jtkim-kaist / Speech-enhancement

Deep neural network based speech enhancement toolkit
GNU General Public License v2.0
210 stars 62 forks source link

LogSpectralDistance calculation #18

Open 1973Blunt opened 5 years ago

1973Blunt commented 5 years ago

Thanks for sharing your code!

This issue is about LogSpectralDistance calculation. Source code is: LSD=mean(sqrt(mean((log(CL(RangeBin,1:N))-log(NO(RangeBin,1:N))).^2)));

It's not consistent with the reference:

J. Du and Q. Huo, “A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions,” in Proc. INTERSPEECH, Sep. 2008, pp. 569–572. image

Difference is:

  1. LOG(X) is the natural logarithm of the elements of X.
  2. There's no 10.
jtkim-kaist commented 5 years ago

Thank you for your interest to my project,

I just used already existed code so that I didn't carefully concern about the difference between the implementation that I used and that reference you mentioned.

However, I think the difference is minor when we compare our method to others