Design work on how sameas merging works

sunnydean commented 4 years ago

To merge the data we first take the duplicated ones between each table and merge them. Then we take the entries that do not contain a sameas predicate from one table. Then we take the entries that do not contain a sameas predicate from the other table.

We are hard-coding the merge currently and have not made it generic (i.e. you can merge only coops uk and dotcoop datasets). We also have hard-coded the way the merging works, we are currently taking the following fields from the tables:

coopsuk

name
description
URI (unique identifier for the entry)
registered organisation number

dotcoop

homepage
address
lat
long

merged results

name
description
URI (unique identifier for the entry)
registered organisation number
homepage
address
lat
long

https://github.com/SolidarityEconomyAssociation/open-data-and-maps/issues/178

sunnydean commented 4 years ago

ColmMassey commented 4 years ago

We are now publishing data from ICA-Youth. Some of those co-ops have .coop domains and thus are connected to or are the sameas a co-op in the DotCoop data set.

DigitalCommons / open-data-and-maps

Design work on how sameas merging works #184