Can not reproduce the TensorRT result

stochasticai / x-stable-diffusion

Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6

Apache License 2.0

553 stars 35 forks source link

This latency is usually for the first run. You should warmup your run at least 5 iterations then measure the latency by averaging next 50 iterations (our lantency benchmark code).

You can also try to compare with our Stochasticx cli deployment which already supported TensorRT and AITemplate on A100 follow our instruction (here). The commands to deploy TensorRT/AITemplate on your machine are:

stochasticx stable-diffusion deploy --type tensorrt
stochasticx stable-diffusion deploy --type aitemplate

stochasticai / x-stable-diffusion

Can not reproduce the TensorRT result #14