goodformandspectacle / v_and_a

1 stars 0 forks source link

Split materials and techniques arrays into separate terms #6

Open george08 opened 9 years ago

george08 commented 9 years ago

Where you see lists like this...

techniques carving, gilding, colouring, glass working materials Clay, Glass, pigment, gold leaf, thitsi lacquer, ash, teak

... be nice if you could select each separate term.

infovore commented 9 years ago

I'd definitely recommend asking the V&A if they have a normalized data set - whilst I've done some simple normalisation for now, recreating tables they probably have, this one's quite big/slow and I wouldn't want to just split on a comma and get it wrong. (ie - this is data they have, and that's been lost in the transformation to One Big File. The API splits this correctly.)

infovore commented 9 years ago

(But yeah, ideally I wouldn't want those big comma separated strings to be counted as one 'material' which is what they currently are. The materials and techniques tables currently just serve to speed up performance - I'd be much happier if they were correctly modelled, too!)