Closed TiSU32 closed 6 years ago
Added layer norm to all of the arch models. Added model selection.Small change in nonlinearitites.
Added layer norm to all of the arch models. Added model selection.Small change in nonlinearitites.