Closed ColmMassey closed 2 years ago
Fields in the spreadsheet are:
Of which I think the following mappings make currently sense:
There are some fields which look like vocabs/enums, which we could try to support, but that opens the can-of-worms which is vocabs.
Organisation Id Do you mean Location Id?
homepage: Organisation Website facebook: Organisation Facebook twitter: Organisation Twitter
Are they missing off the first list?
How much time required to mirror those fields into a triple store? To be added to OBO set.
Are they missing off the first list?
Oops, they were. I've updated the list. Organisation Id is presumably more correct here... unless the data is one row per location, in which case there may be duplicates. But then there is another problem, which is that the data has more than one location per organisation, which our current standard.csv
schema doesn't really support. In which case we might need to fudge things and try to use Location Id.
...To be added to OBO set.
Yes, noticed that in issue #1, which is why I came here. One of the bigger jobs, I think -
Having looked at the data more closely, I see that yes, Location Id is unique, Organisation Id is not (Cherwell Larder, for example, has a ton of locations).
This may have implications in the directory panel, such as, should it show lots of duplicate names? And the behaviour will depend on the assumptions written into the code, which I'm not certain about. I think maybe Coops UK data has a similar structure, with some organisations appearing multiple times?
Perhaps we should have a little design brainstorm about options for handling 'outlets'. I think Matt had a design to handle CO-ops UK but it hasn't been reviewed in years. It is no doubt something we will encounter again and again.
As I mentioned in #1, I've added an open-data conversion for good-food-oxford (still manual currently).
It's using ESSGLOBAL 2.1. As such I set the Country Id to GB
and leave Territory Id blank... (I wonder if the latter field really deserves to be in the standard.csv schema, it's very ICA specific.)
I use the Location Id as the prirmary identifier, and merely capture (but ignore) the Organisation Id.
There's no usable description field, unfortunately.
As an aside: Cherwell Larder seems to have used a Google form as its homepage URL... so I think the data could do with some cleaning upstream.
Which fields for the projects on this map https://foodmap.goodfoodoxford.org/ can we model easily? Which don't make sense to model and what is left?
The source data is available on Nextcloud at ..CodeOperatives/Owned by Oxford/Food_GFO_21_10_15.xls