DigitalCommons / owned-by-oxford-project

Owned by Oxford
0 stars 0 forks source link

Evaluate Mirroring GFOs food project map #7

Closed ColmMassey closed 2 years ago

ColmMassey commented 3 years ago

Which fields for the projects on this map https://foodmap.goodfoodoxford.org/ can we model easily? Which don't make sense to model and what is left?

The source data is available on Nextcloud at ..CodeOperatives/Owned by Oxford/Food_GFO_21_10_15.xls

wu-lee commented 3 years ago

Fields in the spreadsheet are:

Of which I think the following mappings make currently sense:

There are some fields which look like vocabs/enums, which we could try to support, but that opens the can-of-worms which is vocabs.

ColmMassey commented 3 years ago

Organisation Id Do you mean Location Id?

homepage: Organisation Website facebook: Organisation Facebook twitter: Organisation Twitter

Are they missing off the first list?

ColmMassey commented 3 years ago

How much time required to mirror those fields into a triple store? To be added to OBO set.

wu-lee commented 3 years ago

Are they missing off the first list?

Oops, they were. I've updated the list. Organisation Id is presumably more correct here... unless the data is one row per location, in which case there may be duplicates. But then there is another problem, which is that the data has more than one location per organisation, which our current standard.csv schema doesn't really support. In which case we might need to fudge things and try to use Location Id.

wu-lee commented 3 years ago

...To be added to OBO set.

Yes, noticed that in issue #1, which is why I came here. One of the bigger jobs, I think -

Having looked at the data more closely, I see that yes, Location Id is unique, Organisation Id is not (Cherwell Larder, for example, has a ton of locations).

This may have implications in the directory panel, such as, should it show lots of duplicate names? And the behaviour will depend on the assumptions written into the code, which I'm not certain about. I think maybe Coops UK data has a similar structure, with some organisations appearing multiple times?

ColmMassey commented 3 years ago

Perhaps we should have a little design brainstorm about options for handling 'outlets'. I think Matt had a design to handle CO-ops UK but it hasn't been reviewed in years. It is no doubt something we will encounter again and again.

wu-lee commented 3 years ago

As I mentioned in #1, I've added an open-data conversion for good-food-oxford (still manual currently).

It's using ESSGLOBAL 2.1. As such I set the Country Id to GB and leave Territory Id blank... (I wonder if the latter field really deserves to be in the standard.csv schema, it's very ICA specific.)

I use the Location Id as the prirmary identifier, and merely capture (but ignore) the Organisation Id.

There's no usable description field, unfortunately.

As an aside: Cherwell Larder seems to have used a Google form as its homepage URL... so I think the data could do with some cleaning upstream.