excieve / dragnet

Catching the big fish
MIT License
2 stars 1 forks source link

Vehicles of type "other" are marked as duplicates, while are actually different #51

Open pro100olga opened 4 years ago

pro100olga commented 4 years ago

For this declaration 3 separate vehicles were aggregated as 2

Initial declaration:

96493135_2300822030223582_7973750528989462528_n

Aggregation:

96405253_890413061404074_5693113158122602496_n

This is because we check objects (vehicles in this case) for duplication using combination of object type, brand, model and production year.

In this particular case 2 different vehicles (квадроцикл \ снігохід) happened to have the same brand, model and production year, while object was assigned to "Other" (and the difference can be spotted because the description of other type is different)

Possible solution: add 'otherObjectType' field to duplicates detection.