intel / xFasterTransformer

Apache License 2.0
322 stars 56 forks source link

[Models/Layers/Kernels] Add Baichuan1/2 full-link bf16 support & Fix next-tok gen bug #407

Closed abenmao closed 2 months ago