To merge the data we first take the duplicated ones between each table and merge them. Then we take the entries that do not contain a sameas predicate from one table. Then we take the entries that do not contain a sameas predicate from the other table.
We are hard-coding the merge currently and have not made it generic (i.e. you can merge only coops uk and dotcoop datasets). We also have hard-coded the way the merging works, we are currently taking the following fields from the tables:
coopsuk
name
description
URI (unique identifier for the entry)
registered organisation number
dotcoop
homepage
address
lat
long
merged results
name
description
URI (unique identifier for the entry)
registered organisation number
homepage
address
lat
long
We are now publishing data from ICA-Youth. Some of those co-ops have .coop domains and thus are connected to or are the sameas a co-op in the DotCoop data set.
To merge the data we first take the duplicated ones between each table and merge them. Then we take the entries that do not contain a sameas predicate from one table. Then we take the entries that do not contain a sameas predicate from the other table.
We are hard-coding the merge currently and have not made it generic (i.e. you can merge only coops uk and dotcoop datasets). We also have hard-coded the way the merging works, we are currently taking the following fields from the tables:
coopsuk
dotcoop
merged results
https://github.com/SolidarityEconomyAssociation/open-data-and-maps/issues/178