Closed LarrySnyder closed 3 weeks ago
Old code is faster -- why?
Some of it seems to be loss_function -- try pre-computing standard normal loss and then calculate non-standard using those.
loss_function
duplicate of #160
Old code is faster -- why?