aws-neuron / aws-neuron-sdk

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services
https://aws.amazon.com/machine-learning/neuron/
Other
420 stars 136 forks source link

Added inf2 instance type #896

Open jdwisman opened 1 month ago

jdwisman commented 1 month ago

Description:

MANDATORY: PR needs test run output

Test Run Output: Please specify the release version, instance size and type, OS type and test output.

Training tutorial: Convergence graph for training tutorials Performance metrics average_throughput, latency_p50, latency_p99 and MFU% if available

Please make sure this PR contains correct classification terms (Alpha, Beta, and Stable).

If possible, provide your results or a link to them for the reviewer to check your work.

Issue #, sim, or t.corp if available:

Link to RTD for my changes: https://awsdocs-neuron-staging.readthedocs-hosted.com/en/YOUR_BRANCH_NAME/

Additional context:

PR Checklist

Pytest Marker Checklist

(Coming soon...)

Reviewer Checklist

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.