openppl-public / ppl.llm.serving

Apache License 2.0
122 stars 13 forks source link

[feature] support more cache layout and quant bit=0, quant group=1 for mlu #49

Closed Vincent-syr closed 8 months ago