huggingface / optimum-neuron

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Apache License 2.0
176 stars 51 forks source link

Extend TGI integration tests #561

Closed dacorvo closed 2 months ago

dacorvo commented 2 months ago

What does this PR do?

This extends the TGI integration tests by running all tests on not only llama but also gpt2 model configurations.