Open dgoldenberg-audiomack opened 3 years ago
Hi, is there any update on this issue? Thanks.
As a piece of general advice I strongly recommend starting with general deep learning resources or courses. They will have decent answers to all of you questions; none of this is specific to TensorFlow Recommenders.
I struggle to find a good explanation of the raw features -> vocabularies -> embeddings flow, but this is a reasonable article.
Thanks, @maciejkula. I'll check out the link. It feels like more developers will encounter the same questions; I think the library would benefit from having recipes/samples which address these.
numeric ratings
Looking into embeddings, what's not clear to me is, the TFRS models focus on two towers: query/users and items. Since ratings are tuples of { user ID, item ID, rating }, it's not clear what tower to graft them on. Might there need to be a "third tower"?
I think others may be looking for the same kinds of info as me.
Working through the featurization doc, questions arise (can't seem to find relevant info on SOF):
adapt
method. The Retrieval tutorial does not use adapt. How important is it to convert, for example, string user ID's into integers? Is this step a must? "During model training, the value of that vector is adjusted to help the model predict its objective better." -- does this mean that without the conversion to integers, accuracy of predictions will be a lot worse? If this step is done, would I still be able to use the string user ID's when looking up predictions, or do I need to use the integers? How does one go back and forth between the two forms?genre
to be treated as N times more important than for example the duration of the movie, how do I specify that? Or, do we leave it up to TF itself to learn what's more imporant?