jafioti / luminal

Deep learning at the speed of light.
https://luminalai.com
Apache License 2.0
1.45k stars 90 forks source link

Mistral Q8 Cuda invalid memory access #43

Closed jafioti closed 4 months ago

jafioti commented 6 months ago

Unclear why this occurs, seems to happen upon dropping a buffer in the copyfromdevice op, though it might just be that that was the last op to run before other buffers were dropped.

This only started happening after adding q8, didn't happen on llama which uses fp16