NVIDIA-Merlin / NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Apache License 2.0
1.04k stars 143 forks source link

[DOC] Examples of using NVT tags and selectors #1296

Open karlhigley opened 2 years ago

karlhigley commented 2 years ago

Is your feature request related to a problem? Please describe. We now have the ability to select columns from schemas using tags while constructing NVT workflows, but don't yet have documentation of how to use this feature.

Describe the solution you'd like Create a new example that explains how to use schemas and tags.

Describe alternatives you've considered

Additional context This functionality was created for integration with Transformers4Rec, which I think does have some examples.

bschifferer commented 2 years ago

I dont think we need an extra example for Column Tagging API. I integrated the feature in the scaling criteo example.