thunlp / EntityDuetNeuralRanking

Entity-Duet Neural Ranking Model
MIT License
153 stars 20 forks source link

Separate matching on title and on Snippet #9

Closed pommedeterresautee closed 4 years ago

pommedeterresautee commented 6 years ago

I have separated the matching Query Vs Title and Query Vs Snippet It has increased the inference time on 10 cores CPU from 50ms to 67ms (still manageable). MAP (model with CNN) improved from 0.31 to 0.36 (with raw click on a search engine using BM25). For what it worths, model without CNN (not separating title and snippet text) had a 0.27 MAP... meaning in my case, separating matching on title and snippet text improves perf in a similar way than adding CNN.

You may want to test this approach on SOGOU / Bing.

EdwardZH commented 5 years ago

I think more information will help to improve model performance. We will test on a public datatset, SogouQCL.

EdwardZH commented 4 years ago

Hi, for more neural IR training, data augmentation and more about EDRM. Please refer to our WWW2020 Paper Selective Weak Supervision for Neural Information Retrieval. Thank you for your attention. I will close this issue.