RFC: Keras Recommenders (Keras-RS)

We have used TensorFlow Recommenders (TFRS) to generate sets of recommended content for users. We are mostly optimizing for user clicks, so the problem is binary classification. (We want to present content to users that they will click.) Here are some capabilities that are important to us:

Leverage rich user, context, and item attributes in models.
Simple feature preprocessing.
Easily evaluate and compare the performance of the models in terms of ROC-AUC, top K categorical accuracy, and any other metrics suitable for evaluating recommendation engines.
Leverage transfer learning (e.g. training retrieval and ranking models simultaneously).
Take negatives into account in retrieval. (The standard retrieval models in TFRS are trained only on positive examples and therefore haven't learned from negative examples.)
Easily examine feature importances (we currently use SHAP and learned weight matrices).
Easily deploy and test models in production that seamlessly incorporate the feature preprocessing.

The primary challenge we have faced with TFRS is that the deployed models do not scale well; We have a custom multi-model endpoint that first invokes the retrieval model, then cross joins the retrieved items with the users, then invokes the ranking model. The endpoint times out or runs out of memory when we invoke it with millions of queries.

keras-team / keras

RFC: Keras Recommenders (Keras-RS) #20080

Keras Recommenders (Keras-RS)