microsoft / semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps
https://aka.ms/semantic-kernel
MIT License
22.12k stars 3.3k forks source link

.Net: Bump Microsoft.ML.OnnxRuntimeGenAI from 0.4.0 to 0.5.1 in /dotnet #9744

Closed dependabot[bot] closed 1 week ago

dependabot[bot] commented 1 week ago

Bumps Microsoft.ML.OnnxRuntimeGenAI from 0.4.0 to 0.5.1.

Release notes

Sourced from Microsoft.ML.OnnxRuntimeGenAI's releases.

v0.5.1

Release Notes

In addition to the features in the 0.5.0 release, this release adds:

  • Add ability to choose provider and modify options at runtime
  • Fixed data leakage bug with KV caches

Features in 0.5.0:

  • Support for MultiLoRA
  • Support for multi-frame for Phi-3 vision and Phi-3.5 vision models
  • Support for the Phi-3 MoE model
  • Support for NVIDIA Nemotron model
  • Support for the Qwen model
  • Addition of the Set Terminate feature, which allows users to cancel mid-generation
  • Soft capping support for Group Query Attention
  • Extend quantization support to embedding and LM head layers
  • Mac support in published packages

Known issues

  • Models running with DirectML do not support batching
  • Python 3.13 is not supported in this release

v0.5.0

Release Notes

  • Support for MultiLoRA
  • Support for multi-frame for Phi-3 vision and Phi-3.5 vision models
  • Support for the Phi-3 MoE model
  • Support for NVIDIA Nemotron model
  • Support for the Qwen model
  • Addition of the Set Terminate feature, which allows users to cancel mid-generation
  • Soft capping support for Group Query Attention
  • Extend quantization support to embedding and LM head layers
  • Mac support in published packages

Known issues

  • Models running with DirectML do not support batching
  • Python 3.13 is not supported in this release
Commits
  • e8cd6bc Update Phi-3 vision example and add Phi-3.5 vision example (#1049)
  • 83ddc3d Fix up projection bias calculation (#1059)
  • cc4577e Fix data leakage in KV cache initialization (#1057)
  • f66e4f5 Add ability to modify provider options at runtime (#1048)
  • 95c43f1 Automatically calculate perplexity metrics for model supportability (#1008)
  • 2f2686f Reading from wrong variable in model config for enable_mem_pattern (#1047)
  • dc9bc02 Change Login Method Azure - Packaging Pipelines (#1044)
  • 8965fed Add Multi-Image Support with phi-3.5 in chat app demo (#1036)
  • 4bf84d8 Fix Nuget Validation Stage (#1039)
  • 9138a0a Include prompt templates in genai_config.json in model builder (#1020)
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
dmytrostruk commented 1 week ago

Any versions of the Onnx package after 0.4.0 needs to be updated properly handle the resources, or will fail with the message below when the application finishes. I'm holding this update, for the moment.

@RogerBarreto we can fix the version to 0.4.0 to avoid automatic version updates if needed.

dependabot[bot] commented 1 week ago

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.