facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
4.55k stars 369 forks source link

Different results between AITemplate example and DreamStudio for SD 2.0 #108

Open Suhail opened 1 year ago

Suhail commented 1 year ago

Using a DDIM scheduler, I get different results: CleanShot 2022-11-27 at 12 30 22

Is anyone else getting predictable results with the same seed/scheduler? It makes me think there's a bug possibly.

cc: @terrychenism

mvpatel2000 commented 1 year ago

Not sure if related, but I'm seeing significant differences with the groupnorm implementation in AITemplate on the order of 1e-4 compared to PyTorch. Not quite sure why since the accumulation does happen in fp32.... Could also be different issue on my end misconfiguring AITemplate for what I'm doing

kamalkraj commented 1 year ago

https://github.com/facebookincubator/AITemplate/pull/114