karpathy / llm.c

LLM training in simple, raw C/CUDA
MIT License
23.6k stars 2.64k forks source link

WIP Distribution Visualisation to help with FP8 work & beyond #618

Open ademeure opened 3 months ago

ademeure commented 3 months ago

Not ready for integration at all / still very hacky, bunch of unsolved issues I am not sure where code should go etc.:

... but...

It's pretty cool! :)

image