Open george08 opened 9 years ago
I'd definitely recommend asking the V&A if they have a normalized data set - whilst I've done some simple normalisation for now, recreating tables they probably have, this one's quite big/slow and I wouldn't want to just split on a comma and get it wrong. (ie - this is data they have, and that's been lost in the transformation to One Big File. The API splits this correctly.)
(But yeah, ideally I wouldn't want those big comma separated strings to be counted as one 'material' which is what they currently are. The materials
and techniques
tables currently just serve to speed up performance - I'd be much happier if they were correctly modelled, too!)
Where you see lists like this...
techniques carving, gilding, colouring, glass working materials Clay, Glass, pigment, gold leaf, thitsi lacquer, ash, teak
... be nice if you could select each separate term.