Closed peggynewman closed 3 years ago
NT Fauna corrected and dr361 loaded with the correct records. Checking sensitivity after reload and re-index.
There appear to be duplicate keys in dr362. Checking to see if there is an additional term in the key terms.
dr15866 comes from a Royal Botanic Garden IPT server. stateProvince usually refers to region in this dataset 'The Northern Territory' matches 225394 records, Central Australia North gives 9055, Darwin and Gulf gives 2114. 236563 out of 325548 records. dr15866 shows 325405, 260799 in the NT which is roughly in line with the numbers quoted above. The IPT doesn't provide the original of the record.
See, also #400
Oh dear, this is at least the second time I've (noticed that I've) opened a second GH issue for the same thing. Sorry everyone.
@nielsklazenga check out the text from the email to review the numbers for DNA AVH.
@peggynewman , the DNA AVH Darwin Core Archive that was uploaded last Friday contained 325,548 records, so the numbers for the DNA herbarium in ALA looks pretty good.
Where do you think their "276,981" comes from? @nielsklazenga
I think a lot of the NT (that's the herbarium in Alice Springs) records might be in ALA (and in the cache I keep) twice. They were initially delivered as part of the DNA collection, but split off later. When they start delivering their own records, that will sort itself out.
@charvolant tomorrow I will catch up with Peggy to summarise the collection community findings and actions at our end so we can get back to them.
If @peggynewman is not across the latest status for this would you mind letting us know where it is?
I think the last thing remaining was to analyse/find an explanation for the sensitive records.
Thanks
Sent thank you email and summary today.
dr361 Fauna Atlas NT dr362 Flora Atlas NT dr15866 Flora Atlas NT
Feedback below. Action is to reload CSVs from the upload server into pipelines. Need to be converted to DWCA.