Open llewelld opened 1 month ago
Python is doing a lot of heavy lifting and hiding a lot of the complexity. It'd be interesting to compare the Python GPT code with Kaparthy's pure C/CUDA implementaton of the same:
https://github.com/karpathy/llm.c
There's a text-based walkthrough from Karpathy in the discussions on the repo:
https://github.com/karpathy/llm.c/discussions/481
It might be interesting to convert this to use SYCL, which could potentially be even easier and it doesn't look like it's been done yet.
Python is doing a lot of heavy lifting and hiding a lot of the complexity. It'd be interesting to compare the Python GPT code with Kaparthy's pure C/CUDA implementaton of the same:
https://github.com/karpathy/llm.c
There's a text-based walkthrough from Karpathy in the discussions on the repo:
https://github.com/karpathy/llm.c/discussions/481