PygmalionAI / aphrodite-engine

PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
606 stars 78 forks source link

feat: EETQ #408

Closed AlpinDale closed 1 month ago

AlpinDale commented 1 month ago

Very simple implementation, doesn't work with TP.

Example model here: https://huggingface.co/alpindale/Mistral-7B-Instruct-v0.2-EETQ