[DOC] Document which algorithms expect Fortran vs. C contiguous data

rapidsai / cuml

cuML - RAPIDS Machine Learning Library

Apache License 2.0

4.16k stars 526 forks source link

Opened a PR that should inform users when a possibly useless copy is performed. As stated here, data on host (Numpy arrays and Pandas dataframes) will be copied over to device anyways, cuDF dataframes are deepcopied too and cuDF series are 1D and thus not affected by the issue. Then only cuda array interface compliant arrays (and numba arrays) can be copied only because of data order/contiguousness change. This change should allow the user to be informed.

If the user is informed through logging, is it necessary to also document it? If so, should we add the expected data order/contiguousness on the documentation of each function parameter providing data everywhere in the entire library? What should we do when function parameters are left undocumented (many occurrences)?

rapidsai / cuml

[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929