Closed ndgrigorian closed 2 weeks ago
@vtavana I've made a change that fixes the copy issue. Please test it out.
I will want to refactor the code a bit to reduce redundancy before this is merged, but this resolved the problem for me locally.
Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. :crossed_fingers:
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_198 ran successfully. Passed: 895 Failed: 0 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_198 ran successfully. Passed: 895 Failed: 0 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_200 ran successfully. Passed: 894 Failed: 1 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_203 ran successfully. Passed: 894 Failed: 1 Skipped: 119
The relevant tests for dpnp.pad
have now successfully passed with these updates.
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_204 ran successfully. Passed: 894 Failed: 1 Skipped: 119
This PR resolves https://github.com/IntelPython/dpctl/issues/1887
When using sub-group loads and stores, certain alignment of pointers is required. Copies to C-contiguous memory were not properly checking alignment, which would lead to incorrect results.
Before, using the example in #1887:
with this change: