NVIDIA-Merlin / NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Apache License 2.0
1.05k stars 143 forks source link

Remove use of DataFrame merge from `test_embedding_cat_export_import` #1853

Closed oliverholworthy closed 1 year ago

oliverholworthy commented 1 year ago

Remove use of DataFrame merge from test_embedding_cat_export_import

DataFrames merges in cuDF result in non-deterministic row ordering which was causing problems with this test running on certain devices (e.g. P100 GPUs).

github-actions[bot] commented 1 year ago

Documentation preview

https://nvidia-merlin.github.io/NVTabular/review/pr-1853