Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services
I am struggling to trace BLIP2 model from transformers library using torch_neuronx to make it work on an inf2. The model I want to trace is the XXL version but this one doesn't fit on one neuron device. So I want to start by tracing a smaller model, the flan-t5-xl version on an inf2.8xlarge.
The tracing actually completes, but the error is raised when loading the traced model.
How should i trace the model? It uses T5 to generate text, so maybe the problem comes from this part. Could you please provide help or guidance on that? Thanks!
Hi KhalilGuetari, the Unknown opcode for unpickling at ... error seems to be coming from the PyTorch framework code. I've redirected your request to an internal team who may be more familiar with that code.
Hello,
I am struggling to trace BLIP2 model from transformers library using
torch_neuronx
to make it work on an inf2. The model I want to trace is the XXL version but this one doesn't fit on one neuron device. So I want to start by tracing a smaller model, the flan-t5-xl version on an inf2.8xlarge.The tracing actually completes, but the error is raised when loading the traced model.
Trace Model Code:
Inference Code
Error Message
I also ran
torch_neuronx.analyze()
for more information:How should i trace the model? It uses T5 to generate text, so maybe the problem comes from this part. Could you please provide help or guidance on that? Thanks!