Open dswah opened 8 years ago
Is this computation necessary? can we leverage a previous matrix product?
there are lots of implicit Gamma computations in the check descent loop. these should be cached if possible!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
FIX THIS!! the coord descent loop is so wasteful >:0 !!!
they show up in grad_wrt_theta and maybe more places