Open cyril-k opened 3 months ago
Related issue: aws-neuron/transformers-neuronx#71
Thanks for reporting the problem. We have a fix for this and will be releasing it along with a sample for this model in https://github.com/aws-neuron/aws-neuron-samples in the upcoming release.
When attempting to compile Mixtral-8x7B-Instruct-v0.1, I get the following error:
log contents relative to this error:
I used Deep Learning AMI Neuron (Ubuntu 22.04) 20240311 on inf2.24xlarge instance. I installed neuronx-cc and transformers-neuronx from source.
NeuronX Compiler version 2.12.68.0+4480452af Python version 3.10.12 HWM version 2.12.0.0-422c9037c NumPy version 1.25.2 transformers-neuronx version 0.9.20240321
Code to reproduce the error: Note: it is necessary to modify "sliding_window" from "null" to 4096 in the config.json in the model directory to reproduce this bug.