Closed szagoruyko closed 6 years ago
F.kl_div
Checked that this produces the same results as in the paper.
Fixes #18 and #7
F.kl_div
normalizes by the number of elements in tensor, fixed to normalize by minibatch sizeChecked that this produces the same results as in the paper.
Fixes #18 and #7