sony / model_optimization

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
https://sony.github.io/model_optimization/
Apache License 2.0
331 stars 53 forks source link

Substitution/scaled dot product attention #1229

Closed yarden-yagil-sony closed 1 month ago

yarden-yagil-sony commented 2 months ago

Pull Request Description:

Checklist before requesting a review: