osome-iu / hoaxy-backend

Backend component for Hoaxy, a tool to visualize the spread of claims and fact checking
http://hoaxy.iuni.iu.edu/
GNU General Public License v3.0
139 stars 44 forks source link

Problems with duplicate and missing results #35

Closed filmenczer closed 5 years ago

filmenczer commented 5 years ago

Currently the front-end gets weird results from the back-end:

In the article table, the column group_id is meant to identify multiple copies of the same article (articles with the same title). Lucene should index only one article among those with the same group_id.

The Lucene search function has a duplicate filter to avoid having duplicate results.

One or both of the above must have broken in the new version.

chathuriw commented 5 years ago

We no longer see duplicates, missing articles