NVIDIA-Merlin / Merlin

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
Apache License 2.0
715 stars 111 forks source link

[RMP] Cross-framework model evaluation metrics #407

Open karlhigley opened 2 years ago

karlhigley commented 2 years ago

Problem statement

Merlin models should allow you to train models from different libraries (TF/PT/lightfm/xgboost/implicit etc) and then compare them in a consistent "apples-to-apples" type way. Right now each library has its own evaluation metrics, that aren't directly comparable to other frameworks. During the paper experiments for instance, we had to implement custom metric code for implicit to match the output from TF. Customers also want to be able to compare to their own metrics.

Goals

Constraints

Starting Point

Notes

Tasks

( Ground work for cross framework evaluation )

( Enter the goal here )

viswa-nvidia commented 2 years ago

@karlhigley @jperez999 , could you please update this ticket with the problem , goal and constraints. I believe you are laying a lot of the ground work in 22.07. Could you please add that information here. Let me know if you are running in to any difficulties.

karlhigley commented 2 years ago

Duplicate of NVIDIA-Merlin/models#450

karlhigley commented 1 year ago

Closing https://github.com/NVIDIA-Merlin/models/issues/450 instead

viswa-nvidia commented 1 year ago

Based on discussion with Karl, the tasks are not fully listed down

EvenOldridge commented 1 year ago

@marcromeyn can you review and add any comments/concerns.

viswa-nvidia commented 1 year ago

@marcromeyn please review definition and add tasks for 22.11 and 22.12 - starting point specifically

viswa-nvidia commented 1 year ago

@marcromeyn , please add information about the PoC in the ticket