Open ajtucker opened 4 years ago
Data split into 2, Registered deaths and date of death. Only Registered deaths are uploaded to PMD4 at the moment due to current limitations of the system. Needs to be changed back when PMD4 is capable of loading more than one dataset per pipeline
Running csv2rdf
on covid-19-death-occurrences-observations.csv
produces an empty file. I guess we need the next step GSS-Cogs/gss-utils#59
To get both datasets through to PMD had to change the Unit column values to Count when some should be Cumulative Count for the Date of Death dataset
Had some problems with duplicate rows. Made changes to both datasets but removed the Cumulative Count values altogether in the DATE OF DEATH dataset to get it published on PMD4 but these can be derived from the main data anyway so is not a major loss. Both datasets now published in PMD4 so moving this issue to the 'To Review' column
I've added a mapping for the data markers as we now have the "GSS Harmonised Symbols" codelist back.
However, running the update now shows an error with duplicate observations again.
Adding a label to pipelines that have the float64
temporary workaround.
Note that float64
isn't a valid XML Scheme datatype, it should be double
(64 bit float) or ideally decimal
(no limit on decimal places).
Stage 2 spec has been completed (for a second time). Waiting to find out exactly what the hyphens '-' mean but data can be transformed in the mean time but don't publish.
In the current data, the counts are showing up as decimals with a .0
on the end and in the RDF are all typed as strings, so we have to coerce them into xsd:decimal
s before we can add them up.
We should ensure that they're all integer counts.
Email from NISRA about '-' in data values
Hi David (Hull) Thank you for the reminder – your email got lost in a sea of emails and I haven’t had a chance to do my usual review of emails to check what’s outstanding! The ‘–‘ represents no covid cases and that is the same for all tables, so for example in Table 6 the ‘–‘ in Antrim & Newtownabbey on Week 38 just means that no cases were recorded. You are right though, that is not consistent with other tables where zeros are recorded! I hadn’t even noticed that! Claire (claire.rocks@nisra.gov.uk)
Both datasets checked. All looks fine except for know issues (error accessing landing page from PMD, etc.) and filters are not working.
BA Tech Spec Review 09/07/2020 Only tables 1-7 (deaths registered) have been included in the stage 1 spec, consider if tables 8-11 (death occurrences)should also be included.