QDucasse / nn_benchmark

🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas
MIT License
24 stars 1 forks source link

Bump onnxruntime from 1.2.0 to 1.5.2 #7

Closed dependabot-preview[bot] closed 3 years ago

dependabot-preview[bot] commented 3 years ago

Bumps onnxruntime from 1.2.0 to 1.5.2.

Release notes

Sourced from onnxruntime's releases.

ONNX Runtime v1.5.2

This is a minor patch release on 1.5.1 with the following changes:

ONNX Runtime v1.5.1

Key Updates

General

  • Reduced Operator Kernel build allows ORT binaries to be built with only required operators in the model(s) - learn more
  • [Preview] ORT for Mobile Platforms - minimizes build size for mobile and embedded devices - learn more
  • Transformer model inferencing performance optimizations
    • Perf improvement for DistilBERT
    • Benchmark tool supports more pretrained models
  • Improvements in quantization tool
    • Support quantization-aware training models
    • Make calibration tool to support general preprocessing and calibrate on input
    • Simplify the quantization APIs
    • Support of model larger than 2G
  • New operators for static quantization: QLinearMul, QLinearAdd, QlinearSigmoid and QLinearLeakyRelu
  • Prepack constant matrix B for float GEMM (MatMul, Attention)
  • Limited Python 3.8 support added in addition to 3.5-3.7 for official Python packages. Not yet supported for Windows GPU and Linux ARM builds.
  • Telemetry enabled in Java and NodeJS packages for Windows builds. Note: data is not directly sent to Microsoft or ORT teams by ONNX Runtime; enabling telemetry means trace events are collected by the Windows operating system and may be sent to the cloud based on the user's privacy settings - learn more.

API

  • Python API support for RegisterCustomOpsLibrary
  • IO Binding API for C/C++/C# language bindings. This allows use of pre-allocated buffers on targeted devices and also target device for unknown output shapes.
  • Sharing of allocators between multiple sessions. This allows much better utilization of memory by not creating a separate arena for each session in the same process. See this for details.

Windows ML

  • NuGet package now supports UWP applications targeting Windows Store deployment (CPU only)
  • NuGet package now supports .NET and .NET framework applications
  • RUST Developers can now deploy Windows ML – sample and documentation available here
  • New APIs to for additional performance control:
    • IntraopNumThreads: Provides an ability to change the number of threads used in the threadpool for Intra Operator Execution for CPU operators through LearningModelSessionOptions.
    • SetNamedDimensionOverrides: Provides the ability to override named input dimensions to concrete values through LearningModelSessionOptions in order to achieve better runtime performance.
  • Support for additional ONNX format image type denotations – Gray8, normalized [0..1] and normalized [-1..1]
  • Reduced Windows ML package size by separating debug symbols into separate distribution package.

Execution Providers

  • CUDA updates
    • CUDA 10.2 / cuDNN 8.0 in official package
    • CUDA 11 support added and available to build from source
    • CUDA conv kernel support asymmetrical padding to fully support models such as YoloV3 for improved GPU perf
  • TensorRT EP updates
    • Support for TensorRT 7.1
Commits


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) - `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language - `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language - `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language - `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language - `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com): - Update frequency (including time of day and day of week) - Pull request limits (per update run and/or open at any time) - Out-of-range updates (receive only lockfile updates, if desired) - Security updates (receive only security updates, if desired)
dependabot-preview[bot] commented 3 years ago

Superseded by #12.