yongzhuo / nlg-yongzhuo

中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize tool or tookit)
https://blog.csdn.net/rensihui
MIT License
405 stars 53 forks source link

feature_base/text_teaser.py的问题 #5

Closed 1397981186 closed 4 years ago

1397981186 commented 4 years ago

计算title为什么使用 (len_mix_word + 1.0) / (len_mix_word + 2.0) / len_title_word 论文原文里应该是 len_mix_word/len_title_word

yongzhuo commented 4 years ago

提高这个feature的权重(没有重合词语的时候)