Sotera / GEQE

Geo Event Quey by Example - Leverage geo-located temporal text data in order to identify similar locations or events.
http://sotera.github.io/GEQE
The Unlicense
8 stars 7 forks source link

Duplicate entries of individual tweets #36

Open drJAGartner opened 9 years ago

drJAGartner commented 9 years ago

There are several instances of points where you will click through the first few tweets, and they have identical users and texts. We need to check if posts against the others we are returning and remove duplicates.

eric-kimbrel commented 9 years ago

This one sounds more like a data issue. We should make sure the data in elastic search is correct rather than trying to verify query results every time.

drJAGartner commented 9 years ago

I don't disagree; is this something that goes into the upload process, or something run atop the database as is?

On Mon, Aug 31, 2015 at 3:00 PM, Eric Kimbrel notifications@github.com wrote:

This one sounds more like a data issue. We should make sure the data in elastic search is correct rather than trying to verify query results every time.

— Reply to this email directly or view it on GitHub https://github.com/Sotera/GEQE/issues/36#issuecomment-136512832.