alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Apache License 2.0
544 stars 50 forks source link

Enable MHA parallel on Arm #107

Closed Reyfone closed 2 months ago