triton-inference-server / fastertransformer_backend

BSD 3-Clause "New" or "Revised" License
411 stars 133 forks source link

int8 support for gptj&gptneox #151

Open rahuan opened 1 year ago