Open andreadasilvabaudet opened 1 year ago
Hello,
Can you give us more information about the failing model?
If this is something we can attempt to reproduce on our end we will see if we can provide a fix. Ideally you can provide an open source model (or inline code) where this behavior is reproducible.
facing same issue when trying to compile vae encoder of sd 2.1 inpainting model, rest all parts of the model i am able to compile but not vae encoder . Using inf2.8xlarge
Same problem here, any solution to this so far?
@jluntamazon I can provide a minimal example if that helps
Hi recog-arch, The Stable Diffusion model is not supported on our inf1 architecture. However we have done some work to get it runnable on our inf2 architecture. See https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/benchmarks/inf2/inf2-performance.html?highlight=stable%20diff So, as a next step I would suggest try compiling your model on that instance.
Still investigating this one on inf2. Using the following test code:
import torch_neuronx
import torch
from diffusers import StableDiffusionInpaintPipeline
pipe = StableDiffusionInpaintPipeline.from_pretrained(
"stabilityai/stable-diffusion-2-inpainting",
torch_dtype=torch.float32,
)
decoder = pipe.vae.decoder
decoder_in = torch.randn([1,4,64,96])
decoder_neuron = torch_neuronx.trace(
decoder, decoder_in, compiler_workdir="decoder_compile", compiler_args="--verbose info"
)
Please let us know if this does not represent the problem use case.
Hi team,
I'm trying to compile a model using a inf1.6xlarge instance, but the compilation gets stuck and it does not finish. These are the last lines of the compilation logs:
I'm compiling using the following script:
Thank you in advance.