MatthewRHermes / mrh

MRH's research code
Other
18 stars 30 forks source link

Microcycle offload #98

Closed valay1 closed 3 months ago

valay1 commented 3 months ago

Bypassing CPU-optimized vj and vk construction to use native get_jk of GPU.

MatthewRHermes commented 3 months ago

point at "gpu" branch plz