NVIDIA-Merlin / Merlin

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
Apache License 2.0
785 stars 118 forks source link

[RMP] Performant large embedding table support #733

Open EvenOldridge opened 2 years ago

EvenOldridge commented 2 years ago

Problem:

Goal:

New Functionality

Constraints:

Architectural consideration

NA

Starting Point:

Model Parallel Support

~Feature engineering that reduces embedding size~

~Reduced Precision Support~

Not storing user embeddings

Inference Support

Serving

## Example
karlhigley commented 2 years ago

This looks good! Two questions:

bschifferer commented 2 years ago

As an success criteria, we need to have benchmarks for each of the point above:

Customer ask us the questions and if we need to answer them, if we provide the functionality. Only if we add run the experiments, we can ensure that the implementation is correct.

viswa-nvidia commented 2 years ago

@marcromeyn , please define this ticket and also create another ticket for SOK

viswa-nvidia commented 1 year ago

@EvenOldridge , please help to define this ticket

viswa-nvidia commented 1 year ago

@edknv , please check with HCTR team and confirm milestone