Decrease GPU buffer allocations using custom heap

praeclarum / webgpu-torch

Tensor computation with WebGPU acceleration

MIT License

576 stars 15 forks source link

Closed praeclarum closed 1 year ago

praeclarum commented 1 year ago

The heap does give a perf increase for small objects but:

The garbage collector isn't aggressive enough to free buffers from the heap
Kernels end up sharing input and output buffers and this makes the readonly_storage hard to enforce.

I'm keeping the code, but disabling by default.