intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization
https://github.com/intel/neural-speed
Apache License 2.0
350 stars 38 forks source link

XeTLA enable BF16 tile_t Init #319

Closed DDEle closed 4 months ago

DDEle commented 4 months ago

Type of Change: Feature

API not changed

Description

As title.

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

IPEX internal tests

Dependency Change?

N/A

DDEle commented 4 months ago

Merged as internal IPEX PR merged.