Closed runame closed 6 months ago
We scale the loss by 1/C here, but also implicitly here for MSELoss and here for BCEWithLogitsLoss. This was not caught by tests since the expensive tests in test_fisher.py are skipped for the GitHub workflow.
1/C
MSELoss
BCEWithLogitsLoss
test_fisher.py
We scale the loss by
1/C
here, but also implicitly here forMSELoss
and here forBCEWithLogitsLoss
. This was not caught by tests since the expensive tests intest_fisher.py
are skipped for the GitHub workflow.