Open ztachip opened 4 weeks ago
Do you have the equivalent simple C implementation of LLM but for inference of LLAMA models. I am trying to build a FPGA accelerator for LLM and a simple reference C code would be very helpful Thanks
Maybe this : https://github.com/karpathy/llama2.c ? A one-file implementation for Llama2 inference in C
This is perfect. Thanks
Do you have the equivalent simple C implementation of LLM but for inference of LLAMA models. I am trying to build a FPGA accelerator for LLM and a simple reference C code would be very helpful Thanks