Closed philschmid closed 3 months ago
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository
What this PR do?
This PR adds a new TGI neuron version to properly support Llama 3 instruct on inferentia.