Force inline_weights_to_neff=True for sdxl's unet for now to be able to bump optimum-neuron to neuron SDK 2.18, to remove after it's patched by the Annapurna team.
Before submitting
[ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ ] Did you make sure to update the documentation with your changes?
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
What does this PR do?
Issue reported here: https://github.com/aws-neuron/aws-neuron-sdk/issues/859
Force
inline_weights_to_neff=True
for sdxl's unet for now to be able to bump optimum-neuron to neuron SDK 2.18, to remove after it's patched by the Annapurna team.Before submitting