bcgov / BCHeritage_arches

BC Heritage Branch Arches configuration, schemas and extensions.
Apache License 2.0
3 stars 0 forks source link

Resolve truncated column data issue #24

Closed LeannePyle closed 2 years ago

LeannePyle commented 2 years ago
bferguso commented 2 years ago

OK, so I've had a look at the data and there is no decent unique column combination we can use to connect the separate spreadsheets. So far I'm thinking the best approach is to look for long values (> 249 characters) in the latest spreadsheet and to work our way backwards to try to fill in the blanks. I think it's going to be a potentially manual process, but I'll try to match the values up the best I can.

So far I'm seeing potential truncation as follows:

Based on this, most columns seem like the can be mapped except significant_fossil_list. We may need to start with the long (133) list and do some manual corrections.

bferguso commented 2 years ago

Spreadsheet created and sent to @LeannePyle & @elisabethdeom via SOFT. Will be available for 7 days.

bferguso commented 2 years ago

Issue moved to bcgov/BCHeritage #136 via ZenHub