pulp-platform / quantlib

A library to train and deploy quantised Deep Neural Networks
Apache License 2.0
17 stars 7 forks source link

Add PACTRMSNorm and SiLU Replacement Pass #13

Closed Victor-Jung closed 7 months ago

Victor-Jung commented 7 months ago

This PR adds a new operator and two replacement passes that will be useful for deploying llama-based transformer models.

Added