Open mbazzani opened 1 year ago
On pseudo branch, cross entropy loss is very high, ~200 on par with BCEWL, but typical is expected to be about 4
On pseudo branch, cross entropy loss is very high, ~200 on par with BCEWL, but typical is expected to be about 4