Reduced Operator Kernel build allows ORT binaries to be built with only required operators in the model(s) - learn more
[Preview] ORT for Mobile Platforms - minimizes build size for mobile and embedded devices - learn more
Transformer model inferencing performance optimizations
Perf improvement for DistilBERT
Benchmark tool supports more pretrained models
Improvements in quantization tool
Support quantization-aware training models
Make calibration tool to support general preprocessing and calibrate on input
Simplify the quantization APIs
Support of model larger than 2G
New operators for static quantization: QLinearMul, QLinearAdd, QlinearSigmoid and QLinearLeakyRelu
Prepack constant matrix B for float GEMM (MatMul, Attention)
Limited Python 3.8 support added in addition to 3.5-3.7 for official Python packages. Not yet supported for Windows GPU and Linux ARM builds.
Telemetry enabled in Java and NodeJS packages for Windows builds. Note: data is not directly sent to Microsoft or ORT teams by ONNX Runtime; enabling telemetry means trace events are collected by the Windows operating system and may be sent to the cloud based on the user's privacy settings - learn more.
API
Python API support for RegisterCustomOpsLibrary
IO Binding API for C/C++/C# language bindings. This allows use of pre-allocated buffers on targeted devices and also target device for unknown output shapes.
Sharing of allocators between multiple sessions. This allows much better utilization of memory by not creating a separate arena for each session in the same process. See this for details.
Windows ML
NuGet package now supports UWP applications targeting Windows Store deployment (CPU only)
NuGet package now supports .NET and .NET framework applications
RUST Developers can now deploy Windows ML – sample and documentation available here
New APIs to for additional performance control:
IntraopNumThreads: Provides an ability to change the number of threads used in the threadpool for Intra Operator Execution for CPU operators through LearningModelSessionOptions.
SetNamedDimensionOverrides: Provides the ability to override named input dimensions to concrete values through LearningModelSessionOptions in order to achieve better runtime performance.
Support for additional ONNX format image type denotations – Gray8, normalized [0..1] and normalized [-1..1]
Reduced Windows ML package size by separating debug symbols into separate distribution package.
Execution Providers
CUDA updates
CUDA 10.2 / cuDNN 8.0 in official package
CUDA 11 support added and available to build from source
CUDA conv kernel support asymmetrical padding to fully support models such as YoloV3 for improved GPU perf
TensorRT EP updates
Support for TensorRT 7.1
Commits
ad7cc54 Skip running onnx tests in python mac os pipeline (#5416)
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
- `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language
- `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language
- `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language
- `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language
- `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme
Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com):
- Update frequency (including time of day and day of week)
- Pull request limits (per update run and/or open at any time)
- Out-of-range updates (receive only lockfile updates, if desired)
- Security updates (receive only security updates, if desired)
Bumps onnxruntime from 1.2.0 to 1.5.2.
Release notes
Sourced from onnxruntime's releases.
Commits
ad7cc54
Skip running onnx tests in python mac os pipeline (#5416)a6f0923
Fix for CUDA build break90e132c
Cherry-pick 5378c6c4bab
Move flatbuffers to 1.12 release (#5392)bcbcb25
update onnx-tensorrt submodule (#5442)8206ff4
Support trilinear sampling in Resize CPU and CUDA kernels (#5300)e4f71ab
Exclude GPT2_LM_HEAD from OpenVino's model test list (#5356)d77241e
Fix WinML warnings (#5228)4227dd7
Revert "Move flatbuffers to 1.12 release (#5392)"fc0fc80
Revert "Add flatbuffers verifier for ORT format buffer (#5378)"Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) - `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language - `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language - `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language - `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language - `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com): - Update frequency (including time of day and day of week) - Pull request limits (per update run and/or open at any time) - Out-of-range updates (receive only lockfile updates, if desired) - Security updates (receive only security updates, if desired)