Open MarcoBFreitas opened 2 months ago
I shared wrongly the Env:
Platform:
- Platform: Linux-5.15.0-1053-aws-x86_64-with-glibc2.29
- Python version: 3.8.10
Python packages:
- `optimum-neuron` version: 0.0.22.dev0
- `neuron-sdk` version: 2.18.0
- `optimum` version: 1.18.1
- `transformers` version: 4.36.2
- `huggingface_hub` version: 0.20.3
- `torch` version: 1.13.1+cu117
- `aws-neuronx-runtime-discovery` version: 2.9
- `libneuronxla` version: 0.5.971
- `neuronx-cc` version: 2.13.66.0+6dfecc895
- `neuronx-distributed` version: 0.7.0
- `neuronx-hwm` version: 2.12.0.0+422c9037c
- `torch-neuronx` version: 1.13.1.1.14.0
- `torch-xla` version: 1.13.1+torchneurone
- `transformers-neuronx` version: 0.10.0.21
Neuron Driver:
WARNING: apt does not have a stable CLI interface. Use with caution in scripts.
aws-neuronx-collectives/unknown,now 2.20.11.0-c101c322e amd64 [installed,upgradable to: 2.20.22.0-c101c322e]
aws-neuronx-dkms/unknown,now 2.15.9.0 amd64 [installed,upgradable to: 2.16.7.0]
aws-neuronx-oci-hook/unknown,now 2.2.45.0 amd64 [installed,upgradable to: 2.3.0.0]
aws-neuronx-runtime-lib/unknown,now 2.20.22.0-1b3ca6425 amd64 [installed]
aws-neuronx-tools/unknown,now 2.17.0.0 amd64 [installed,upgradable to: 2.17.1.0]
Can you work around this issue adding this flag --use_xser=False
to the run command?
System Info
Who can help?
@michaelbenayoun
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
Reproduced as in https://huggingface.co/docs/optimum-neuron/en/tutorials/fine_tune_llama_7b
Run optimum-cli neuron consolidate dolly_llama/tensor_parallel_shards dolly_llama through instance terminal
Expected behavior
Expected successfull consolidation into safetensors.