intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization
https://github.com/intel/neural-speed
Apache License 2.0
350 stars 38 forks source link

[DOC]update README #301

Closed intellinjun closed 5 months ago

intellinjun commented 5 months ago

Type of Change

feature or bug fix or documentation or others API changed or not

Description

update README