How to use pertained embedding as feature in the user model

tensorflow / recommenders

TensorFlow Recommenders is a library for building recommender system models using TensorFlow.

Apache License 2.0

1.82k stars 273 forks source link

How to use pertained embedding as feature in the user model #654

Open zhifeng-huang opened 1 year ago

zhifeng-huang commented 1 year ago

Hello,

I have a signature embedding of size 256 for each user. How can I use this vector as a feature in the user model?

Any tips are appreciated.

caesarjuly commented 1 year ago

You can use these embeddings as the initial weights for your user embedding layer. An example as

        embedding_layer = layers.Embedding(
            input_dim=xxx,
            output_dim=xxx,
            weights=[your embedding],
            trainable=True,
        )

ddofer commented 1 year ago

Is this compatible with the examples in the tutorials that use string Lookup? Can this be c ombined with pretrained embedding extractors (like BERT) dynamically, or only if all are extracted in advance? Can we combine that with the tutorial examples/e.g. retrieval task? Thanks!

caesarjuly commented 1 year ago

@ddofer This is a different approach vs. the string lookup in the tutorial.

For this approach, basically you are extracted the embeddings beforehand, you can use any kind of pre-train embedding including Bert.
For the string lookup approach, it's generating one unique embedding for each user or movie ID. It's trained dynamically with the model
If you want to embed Bert models to the retrieval, you can leverage other libraries and import the Bert layers to your model structure, then trained it dynamically with your retrieval

ddofer commented 1 year ago

Great thanks! (I've added that to my retrieval model using strong, large (384 dim) pretrained embeddings. Oddly, results are the same (despite this being a sparse problem for users and items)

caesarjuly commented 1 year ago

This question cannot be answer until you share more details. The reason can be related to multiple factors.

What kind of pretrained embeddings are you using?
How sparse it is?
How many data you have and how complex is the model?
Are the pretrained embeddings fine-tuned dynamically with the model?
What kind of features you already have?