chengzeyi / stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
MIT License
1.16k stars 71 forks source link

Dev #76

Closed chengzeyi closed 9 months ago

chengzeyi commented 9 months ago

optimize triton group norm

fix unused using warning