cmp-nct / ggllm.cpp

Falcon LLM ggml framework with CPU and GPU support
Other
244 stars 21 forks source link

Add support for AMX instructions (bf16 and/or int8) #81

Open WilliamTambellini opened 1 year ago

WilliamTambellini commented 1 year ago

Prerequisites

Your exact command line to replicate the issue

./falcon_main_avx512 -m falcon-40b-instruct.ggccv1.q4_k.bin -f g1.txt -n 100 --top-k 1
...
| Syst. Info | AVX | AVX2 | AVX512 | AVX512_VBMI | AVX512_VNNI | FMA | NEON | ARM_FMA | F16C | FP16_VA | SIMD | BLAS | SSE3 | VSX |
...

Environment and Context

https://www.intel.com/content/dam/develop/external/us/en/documents/architecture-instruction-set-extensions-programming-reference.pdf