shenweichen / DeepMatch

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
https://deepmatch.readthedocs.io/en/latest/
Apache License 2.0
2.19k stars 525 forks source link

preprocess数据集处理 #65

Closed Bradyzzhang closed 2 years ago

Bradyzzhang commented 2 years ago

Please refer to the FAQ in doc and search for the related issues before you ask the question.

Describe the question(问题描述) A clear and concise description of what the question is.

Additional context Add any other context about the problem here.

Operating environment(运行环境):

bbruceyuan commented 2 years ago

完全是可以加入的。比如 item 只要你构造一个类似的 item_profile 然后把相应的特征在构造样本的时候加入进行就OK了。

作者之所以没有这么做应该是有两个方面的考量:

  1. youtubednn item 侧只需要有 movie_id 特征就OK了,为了和 fm/dssm 够用一个 run_example, 所以没有加入相关特征
  2. 为了代码更简洁,反正这只是一个测试的demo。所以你想加item侧特征是OK的,比如item的平均分数。