Open JonesH opened 5 years ago
This would enable merging data like images etc
using generated IDs should solve this?
Only partly. IDs donÄt solve problem of data from separate sources referring to identical event. I'm working on a simple deduplication feature right now. But it's work in progress so nothing to push right now.
But we'll need the event based IDs anyway so let's add them as well :)
Yes, there's two kinds of deduplication:
Only the first one would be solved by generated IDs
Events should be deduplicated wrt events scraped from other pages