Dropped CentOS 7 support. All Linux binaries now require glibc version >=2.28, but users can still build the source code for a lower glibc version.
Added CUDA 12 packages for Python and Nuget.
Added Python 3.12 packages for ONNX Runtime Inference. ONNX Runtime Training Python 3.12 packages cannot be provided at this time since training packages depend on PyTorch, which does not support Python 3.12 yet.
Linux binaries (except those in AMD GPU packages) are built in a more secure way that is compliant with BinSkim's default policy (e.g., the binaries no longer have an executable stack).
Added support for Windows ARM64X for users who build ONNX Runtime from source. No prebuilt package provided yet.
Removed Windows ARM32 binaries from official packages. Users who still need these binaries can build them from source.
Added AMD GPU package with ROCm and MiGraphX (Python + Linux only).
Split ONNX Runtime GPU Nuget package into two packages.
When building the source code for Linux ARM64 or Android, the C/C++ compiler must support BFloat16. Support for Android NDK 24.x has been removed. Please use NDK 25.x or 26.x instead.
Link time code generation (LTCG or LTO) is now disabled by default when building from source. To re-enable it, users can add "--enable_lto" to the build command. All prebuilt binaries are still built with LTO.
Core
Optimized graph inlining.
Added support for supplying a custom logger at the session level.
Performance
Added 4bit quant support on NVIDIA GPU and ARM64.
EPs
TensorRT EP
Added support for direct load of precompiled TensorRT engines and customizable engine prefix.
Added Python support for TensorRT plugins via ORT custom ops.
Fixed concurrent Session::Run bugs.
Updated calls to deprecated TensorRT APIs (e.g., enqueue_v2 → enqueue_v3).
Fixed various memory leak bugs.
QNN EP
Added support for QNN SDK 2.18.
Added context binary caching and model initialization optimizations.
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Bumps onnxruntime-web from 1.14.0 to 1.17.0.
Release notes
Sourced from onnxruntime-web's releases.
... (truncated)
Commits
5f0b62c
[ORT 1.17.0 Release] Cherry-pick Final Round (#19327)3fd94a8
[ORT 1.17.0 Release] Cherry pick 1st round (#19243)daafe63
cherry pick qnn sdk 2.18 updates into release branch (#19197)a63b71e
Cherry-pick "Fix buildJava from Zip-Nuget-Java-Nodejs Packaging Pipeline (#19...80f274c
Fix SkipLayerNormalization shape inference (#18724)e2e488d
Revert "iOS packaging pipeline stability" (#19135)c92f72e
Merge Linux Nuget GPU pipeline with zip-nuget (#19120)8d4369b
Update DirectML nuget version to 1.13.1 (#19122)1bab989
[WebNN EP] Fixed bug in int8 data type processing (#19134)9dee543
fix gemm beta for fp16 (#19153)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show