Closed michaelbenayoun closed 1 month ago
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
You should unpin the safetensor package here because there is now a conflict: https://github.com/huggingface/optimum-neuron/blob/1e7d0f5ae47fd51b2418b1355a2e819e58b69890/text-generation-inference/server/pyproject.toml#L17
I fixed all but one test:
tests/generation/test_tnx_llama.py::test_decoder_generation_multiple_eos_token_ids
What does this PR do?
This PR synchronizes
optimum-neuron
with more recenttransformers
andaccelerate
versions:accelerate==0.29.2
, which is the latest release when this PR is being done,transformers==4.40.2
, which will be the latest releae when this PR is merged.Related PR in
transformers
: https://github.com/huggingface/transformers/pull/30259On top of that: