alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Apache License 2.0
544 stars 50 forks source link

[CPU] add implementation for GEMM and token embedding #101

Closed wenhuanh closed 2 months ago