question on wide and deep

geyingli / unif

基于 Tensorflow，仿 Scikit-Learn 设计的深度学习自然语言处理框架。支持 40 余种模型类，涵盖语言模型、文本分类、NER、MRC、知识蒸馏等各个领域

Apache License 2.0

114 stars 27 forks source link

I think the core value of wide and deep structure is the thought to structurally unify discrete and continuous features. So I wasn't intented to follow all the details from the original work. Another reason is that there are outstanding ideas proposed after wide and deep model came out, like attention machanism and BERT, which could further enhance the performance of the model.

The answers are:

The input of wide side could be any discrete features. It can be a text string, an interger or even float if you want.
Attention machanism was proved to be a successful design. Using it properly improves the performance of NLP tasks (in most of times).

Hope this reply meet your needs :)

geyingli / unif

question on wide and deep #12