Closed matthiasdiener closed 1 month ago
What's wrong with pocl 6? This is something I've seen, but it should not affect mirgecom.
What's wrong with pocl 6? This is something I've seen, but it should not affect mirgecom.
We have seen substantial slowdowns in our prediction cases with both the new type promotion code as well as pocl-5 vs pocl-6: (1 rank, wall time per step [s]):
smoke_test_ks_3d | Porter/GPU | Lassen/GPU | M1/CPU | CI-Linux/CPU |
---|---|---|---|---|
pocl5, old numpy type promo code | 0.17 | 0.18 | 0.6 | 1.05 |
pocl6, old numpy type promo code | 0.58 | 0.47 | 0.65 | 1.41 |
pocl5, new numpy type promo code | 0.21 | 0.19 | 6.5 | |
pocl6, new numpy type promo code | 0.58 | 0.5 | 6.5 | 16.1 |
For some reason, pocl-6 seems to affect mostly the CUDA devices, while the type promotion code mostly affects the CPU runs.
Related: https://github.com/inducer/pytato/pull/538, https://github.com/inducer/pytato/pull/510
Questions for the review: