NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Apache License 2.0
1.05k
stars
143
forks
source link
Remove use of DataFrame merge from `test_embedding_cat_export_import` #1853
Remove use of DataFrame merge from
test_embedding_cat_export_import
DataFrames merges in cuDF result in non-deterministic row ordering which was causing problems with this test running on certain devices (e.g. P100 GPUs).