Bump onnxruntime from 1.2.0 to 1.5.2

Bumps onnxruntime from 1.2.0 to 1.5.2.

Release notes

ONNX Runtime v1.5.2

This is a minor patch release on 1.5.1 with the following changes:

Remove dependency on cudnn64_7.dll for GPU C# nuget: microsoft/onnxruntime#5386

Add config keys header file in the packages for Linux and Mac: microsoft/onnxruntime#5388

Add flatbuffers verifier for ORT format buffer: microsoft/onnxruntime#5378

Use official flatbuffers v1.12: microsoft/onnxruntime#5392

Mitigate pybind11 build break using Xcode 12 on macOS: microsoft/onnxruntime#5381

Support trilinear sampling in the Resize operator: microsoft/onnxruntime#5300

Update TensorRT parser to fix accuracy issue in some opset11 models: microsoft/onnxruntime#5442

ONNX Runtime v1.5.1

Key Updates

General

Reduced Operator Kernel build allows ORT binaries to be built with only required operators in the model(s) - learn more

[Preview] ORT for Mobile Platforms - minimizes build size for mobile and embedded devices - learn more

Transformer model inferencing performance optimizations

Perf improvement for DistilBERT

Benchmark tool supports more pretrained models

Improvements in quantization tool

Support quantization-aware training models

Make calibration tool to support general preprocessing and calibrate on input

Simplify the quantization APIs

Support of model larger than 2G

New operators for static quantization: QLinearMul, QLinearAdd, QlinearSigmoid and QLinearLeakyRelu

Prepack constant matrix B for float GEMM (MatMul, Attention)

Limited Python 3.8 support added in addition to 3.5-3.7 for official Python packages. Not yet supported for Windows GPU and Linux ARM builds.

Telemetry enabled in Java and NodeJS packages for Windows builds. Note: data is not directly sent to Microsoft or ORT teams by ONNX Runtime; enabling telemetry means trace events are collected by the Windows operating system and may be sent to the cloud based on the user's privacy settings - learn more.

API

Python API support for RegisterCustomOpsLibrary

IO Binding API for C/C++/C# language bindings. This allows use of pre-allocated buffers on targeted devices and also target device for unknown output shapes.

Sharing of allocators between multiple sessions. This allows much better utilization of memory by not creating a separate arena for each session in the same process. See this for details.

Windows ML

NuGet package now supports UWP applications targeting Windows Store deployment (CPU only)

NuGet package now supports .NET and .NET framework applications

RUST Developers can now deploy Windows ML – sample and documentation available here

New APIs to for additional performance control:

IntraopNumThreads: Provides an ability to change the number of threads used in the threadpool for Intra Operator Execution for CPU operators through LearningModelSessionOptions.

SetNamedDimensionOverrides: Provides the ability to override named input dimensions to concrete values through LearningModelSessionOptions in order to achieve better runtime performance.

Support for additional ONNX format image type denotations – Gray8, normalized [0..1] and normalized [-1..1]

Reduced Windows ML package size by separating debug symbols into separate distribution package.

Execution Providers

CUDA updates

CUDA 10.2 / cuDNN 8.0 in official package

CUDA 11 support added and available to build from source

CUDA conv kernel support asymmetrical padding to fully support models such as YoloV3 for improved GPU perf

TensorRT EP updates

Support for TensorRT 7.1

Commits

ad7cc54 Skip running onnx tests in python mac os pipeline (#5416)
a6f0923 Fix for CUDA build break
90e132c Cherry-pick 5378
c6c4bab Move flatbuffers to 1.12 release (#5392)
bcbcb25 update onnx-tensorrt submodule (#5442)
8206ff4 Support trilinear sampling in Resize CPU and CUDA kernels (#5300)
e4f71ab Exclude GPT2_LM_HEAD from OpenVino's model test list (#5356)
d77241e Fix WinML warnings (#5228)
4227dd7 Revert "Move flatbuffers to 1.12 release (#5392)"
fc0fc80 Revert "Add flatbuffers verifier for ORT format buffer (#5378)"
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) - `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language - `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language - `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language - `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language - `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com): - Update frequency (including time of day and day of week) - Pull request limits (per update run and/or open at any time) - Out-of-range updates (receive only lockfile updates, if desired) - Security updates (receive only security updates, if desired)

QDucasse / nn_benchmark

Bump onnxruntime from 1.2.0 to 1.5.2 #7

ONNX Runtime v1.5.2

ONNX Runtime v1.5.1

Key Updates

General

API

Windows ML

Execution Providers