XeTLA enable BF16 tile_t Init

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

https://github.com/intel/neural-speed

Apache License 2.0

350 stars 38 forks source link

Closed DDEle closed 4 months ago

DDEle commented 4 months ago

API not changed

As title.

N/A

IPEX internal tests

N/A

DDEle commented 4 months ago

Merged as internal IPEX PR merged.