Open gabrielspmoreira opened 3 years ago
Issue #597 addresses incremental update of the statistics of numerical features
Hi @karlhigley , what's the expected release date of v0.9?
Any progress on this?
@BlakeB415 we dont have bandwidth to work on this feature now. So there is no progress on this feature.
Is your feature request related to a problem? Please describe. It is common in industry to train recommender systems models incrementally, i.e., taking a model trained with past data and fine-tuning with new data. In such cases, new values on existing categorical features need to be encoded as contiguous ids on top of the existing ones, whose embeddings will be appended to the pre-trained embedding tables.
Describe the solution you'd like NVTabular should support incremental pre-processing, by keeping the previous mapping between raw values and encoded values for categorical features (so that they match the position of the pre-trained embeddings), and assigning new values as contiguous item ids.