issues
search
alibaba
/
rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Apache License 2.0
544
stars
50
forks
source link
Enable MHA parallel on Arm
#107
Closed
Reyfone
closed
2 months ago