microsoft / OpenCLOn12

The OpenCL-on-D3D12 mapping layer
MIT License
104 stars 13 forks source link

Improve specialization handling of the parallel compiles for the same kernel #7

Open jenatali opened 3 years ago

jenatali commented 3 years ago

Right now we can end up with multiple threads trying to specialize the same expensive kernel, when it'd be more efficient to just let one do it while the others wait. Need to either introduce per-kernel locks or other async/await mechanism.