huggingface / optimum-neuron

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Apache License 2.0
176 stars 51 forks source link

- new notebook - TGI + SageMaker + Mistral #551

Open samir-souza opened 3 months ago

samir-souza commented 3 months ago

What does this PR do?

This PR includes a new notebook that shows how to compile and deploy Mistral-7B to a SageMaker endpoint with Inf2. I also updated the notebooks.mdx page with a link to the notebook.

Before submitting

HuggingFaceDocBuilderDev commented 1 month ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev commented 1 week ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!