huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools
https://huggingface.co/docs/optimum/main/en/intel/index
Apache License 2.0
364 stars 101 forks source link

Fix gpt_bigcode generation #681

Closed IlyasMoutawwakil closed 3 months ago

IlyasMoutawwakil commented 3 months ago

What does this PR do?

This PR adds gpt_bigcode specific reordering of pask_key_values

Before submitting

HuggingFaceDocBuilderDev commented 3 months ago

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.