StarWang commented 4 years ago

Description

Currently DeText's design for sparse feature has simple modeling power for sparse features.

only linear model is applied on sparse features
there's no interaction between sparse features and dense features (model_score = dense_score + sparse_score)

This PR resolves the above limitation on sparse feature by

computing dense representation of sparse features
allowing interactions between sparse features and wide features

More specifically, the model architecture changes from

dense_score = dense_ftrs -> MLP
sparse_score = sparse_ftrs -> Linear
final_score = dense_score + sparse_score

to

sparse_emb_ftrs = sparse_ftrs -> Dense(sp_emb_size)
all_ftrs = (dense_ftrs, sparse_emb_ftrs) -> Concatenate
final_score= all_ftrs -> MLP

Type of change

[ ] New feature (non-breaking change which adds functionality)

List all changes

Please list all changes in the commit.

Change sp_linear_model to sp_emb_model and add an option sp_emb_size to allow the sparse matrix to have output dimension > 1
Change structure of dense & sparse feature interaction as mentioned in the PR description
Add and restructure unit test for sparse embedding model
Add new data for testing
Add a sample tfrecord generation helper function in misc_utils.py

Testing

Successfully run run_detext.sh for data including wide_sp_val and sp_emb_size=10
Successfully run run_detext_multitask.sh for data
Unit test for sparse_emb_model when sp_emb_size is 1 and > 1

Checklist
[ ] My code follows the style guidelines of this project
[ ] I have performed a self-review of my own code
[ ] I have commented my code, particularly in hard-to-understand areas
[ ] I have made corresponding changes to the documentation
[ ] My changes generate no new warnings
[ ] I have added tests that prove my fix is effective or that my feature works
[ ] New and existing unit tests pass locally with my changes
[ ] Any dependent changes have been merged and published in downstream modules

zhoutong-fu commented 4 years ago

Two general comments: 1) in either the description or readme file update the latest model structure (can use the new feature Xiaowei added last time that dumps the structure to a txt file; 2) can you check if run_ detext_multitask.sh works on multitask test data (not the sparse wide features test data)?

StarWang commented 4 years ago

Two general comments: 1) in either the description or readme file update the latest model structure (can use the new feature Xiaowei added last time that dumps the structure to a txt file; 2) can you check if run_ detext_multitask.sh works on multitask test data (not the sparse wide features test data)?

@zhoutong-fu Thanks for the review! I've updated the readme file and tested that run_detext_multitask.sh runs successfully

linkedin / detext

Add embedding and MLP support for sparse wide features #24

Description

Type of change

List all changes

Testing

Checklist