Closed RuyiLian closed 3 years ago
Hi,
No problem! Originally the Welford's algorithm estimates the variance as: But we can rewrite:
However, this adds an increasingly small value due to the factor t, which is problematic when t gets large. The code makes use of sum of squares of differences from the mean to alleviate this problem. The paper equation and the one in the code end up doing the same, but the code implementation is a more numerically stable method of computing the standard deviation.
Hope it helps!
Thanks for your reply! It really helps a lot!
Hi,
Thanks for your great work. I just have a question about Eq.7 in your paper
In your implementation (
losses/covweighting_loss.py
) you use:I don't understand how this corresponds to the original Eq. 7. Could you please explain it? Thanks!