huggingface / tgi-gaudi

Large Language Model Text Generation Inference on Habana Gaudi
http://hf.co/docs/text-generation-inference
Apache License 2.0
27 stars 47 forks source link

Upgrade to 2.3.1 #225

Open yuanwu2017 opened 2 months ago

yuanwu2017 commented 2 months ago

What does this PR do?

Fixes # (issue)

Before submitting

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

mandy-li commented 1 month ago

@yuanwu2017 , for the testing, pls add the torch.compile model to your test cases.

yuanwu2017 commented 1 month ago

@yuanwu2017 , for the testing, pls add the torch.compile model to your test cases. Ok.

mandy-li commented 1 week ago

@yuanwu2017 , pls test if any performance regression for llama2, llama3.1, lava-next with this PR