Closed oleksandr-pavlyk closed 2 weeks ago
Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. :crossed_fingers:
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_145 ran successfully. Passed: 894 Failed: 1 Skipped: 119
Build with DPCTL_TARGET_CUDA=ON
succeeds, test suite run on RTX 3050 passed.
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_149 ran successfully. Passed: 894 Failed: 1 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_158 ran successfully. Passed: 894 Failed: 1 Skipped: 119
This line and the initial sorted_block_size
value seem to be unused.
static constexpr size_t determine_automatically = 0;
size_t sorted_block_size =
(sort_nelems >= 512) ? 512 : determine_automatically;
True or not, in merge_sort_detail::sort_over_work_group_contig_impl
the sorted_block_size
is set to the same value with no check against the current value of sorted_block_size
. So these lines can seemingly be removed entirely.
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_174 ran successfully. Passed: 894 Failed: 1 Skipped: 119
Nit: renaming of sort.cpp
/sort.hpp
and argsort.cpp
/argsort.hpp
to merge_sort
and merge_argsort
may be good for clarity.
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_188 ran successfully. Passed: 894 Failed: 1 Skipped: 119
@ndgrigorian I have pushed changes to address all the issues highlighted in review.
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_189 ran successfully. Passed: 895 Failed: 0 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_193 ran successfully. Passed: 895 Failed: 0 Skipped: 119
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_194 ran successfully. Passed: 894 Failed: 1 Skipped: 119
@antonwolfy Thank you for the suggestion. I am going to act on it in a separate PR
This PR implements radix sort, and exposes it via new
kind
keyword ofdpt.sort
anddpt.argsort
functions. Supported values ofkind
keyword are"stable"
,"radixsort"
, and"mergesort"
, with the default value beingNone
(same as"stable"
). The"stable"
kind uses radix sort for boolean and short integral types,