Closed tmabraham closed 1 year ago
with everything enabled (tf32, gradient checkpointing, attention slicing, xformers memory-efficient attention), speeds up single GPU performance by pretty much 2x.
with everything enabled (tf32, gradient checkpointing, attention slicing, xformers memory-efficient attention), speeds up single GPU performance by pretty much 2x.