Open tchaton opened 1 year ago
Unfortunately not all kernels run well on SM75 GPUs. Check this readme: https://fburl.com/pimcs20r.
Hey @ipiszy. Thanks for answering. This page seems protected under internal Meta login.
I have several questions for you ?
I currently have access to T4, A10 and V100.
We did fully test on Ampere GPUs ( A100), for most of kernels Turing GPUs(T4) should work well
@terrychenism. Great to know. Do you know which kernels could have issues on T4 and how to go about debugging them ?
Any chance you could test this out on T4 more deeply too ? T4 are quite cheap GPUs for users to run their inference upon. A100 are quite high end and less accessible.
Hello, I am having a similar issue. I am using an A100 as reported by nvidia-smi:
Fri Dec 23 21:47:17 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 520.61.05 Driver Version: 520.61.05 CUDA Version: 11.8 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA A100-SXM... On | 00000000:00:06.0 Off | 0 |
| N/A 26C P0 49W / 400W | 0MiB / 81920MiB | 0% Default |
| | | Disabled |
+-------------------------------+----------------------+----------------------+
And tried compiling https://huggingface.co/Red54/waifu-diffusion-v1-3-5-tainted with commit https://github.com/facebookincubator/AITemplate/commit/3625c3056041963926ba455347d9091fb891b872 due to the latest version not working well with pre-2.X SD models #133
Using the following prompt: Anime girl, Beautiful, masterpiece, Extremely Delicate Unity CG 8K-Wallpaper, Extremely Delicate Pixiv 8K-Illustration, Best Quality, Hyper Detailed, Intricate Details, Limited Palette, Photographic Incandescents, [Depth Of Field, Bokeh Effect], Focus On Character, Critical Angle, High_Quality ++Pretty Girl
Generates this image:
Which certainly has nothing to do with the prompt. The only modifications I made is changing "Runwayml/Stable-diffusion-v1-5" to "Red54/waifu-diffusion-v1-3-5-tainted"
@terrychenism Any updates or future plans to validate T4 works well ?
@terrychenism Any updates or future plans to validate T4 works well ?
cc @ipiszy
@tchaton I don't have access to T4 gpu. Could you please run the model and localize the ops which are not supported on T4?
@terrychenism Unfortunately, the model inference works as everything compiled properly. But the generated images are random noise. So I can't identify which operation aren't properly compiled.
@ipiszy Can you give me access to fburl.com/pimcs20r ?
@ipiszy Can you give me access to fburl.com/pimcs20r ?
@tchaton This link resolves to https://github.com/facebookincubator/AITemplate#installation , if that helps
Hey there,
I am trying run AITemplate Stable diffusion examples on T4 GPU.
I have tried with the same package version as described in the README using master and this branch: https://github.com/facebookincubator/AITemplate/pull/74/commits/d62f0773eac88623846c192858e6028288054043
I am just getting crappy images.
Would it be possible for you to benchmark and validate the model work on T4 GPU ?
Best, T.C