This is the issue to create the generic Annex A pipeline, using the schema built in #61
The pipeline will need to:
[ ] validate an xlsx file with a tab per Annex A list
[ ] incorporate the new "year_month" element from the filename in exactly the same way "year" is currently handled; for the purpose of differentiating between different returns e.g. Annex A will allow 2024_Jan, 2024_Feb.... where 903 only has 2024; adding the column "year_month" instead of "year" to all files
[ ] produce a cleanfile output of 1 csv per list (so each input file will produce n csvs where n = number of lists in Annex A)
[ ] concatenate cleanfiles together at the la level
[ ] create a reports output for each list for the region
[ ] make the usual logs and outputs (clean, concat, reports) available in the standard places in the infrastructure
This is the issue to create the generic Annex A pipeline, using the schema built in #61 The pipeline will need to: