pulp-platform / quantlib

A library to train and deploy quantised Deep Neural Networks
Apache License 2.0
17 stars 7 forks source link

Llama Support #15

Closed Victor-Jung closed 6 months ago

Victor-Jung commented 7 months ago

This PR adds support for quantizing Llama-flavour transformers. It provides new operators, related passes as well as various fixes.

Added