Closed azrael417 closed 1 year ago
@eqy, @rmhaskarnvidia please review this PR and/or suggest someone to review. I will also take a look, but I am not familiar with cudnn-frontend
.
@azrael417 we can defer addressing the issues I brought up to a later PR if @crcrpar is content to merge the fix given the urgency
This PR fixes the single node group batch norm in APEX to work with cuda 12.2 and RTC.