huggingface / optimum-neuron

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Apache License 2.0
176 stars 51 forks source link

Update cache guide to including the caching for traced models #557

Open JingyaHuang opened 2 months ago

JingyaHuang commented 2 months ago

What does this PR do?

Add doc

Before submitting

HuggingFaceDocBuilderDev commented 2 months ago

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

HuggingFaceDocBuilderDev commented 1 month ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev commented 3 weeks ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

michaelbenayoun commented 3 weeks ago

What's the status on that? @JingyaHuang do you need our reviews?

JingyaHuang commented 3 weeks ago

@michaelbenayoun I will need to update it a bit since there are some changes. Will ask for another review once done!

HuggingFaceDocBuilderDev commented 2 days ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!