Similar to our other pipelines, we need a repeating pipeline (i.e. cronjob) that queries the UCAM API for all known records, and transforms the result to a request to VTT to export their data. Given our current infrastructure, this would look like another `DAG and use similar steps to achieve this.
Additional considerations:
requests for exports are capped at 'today' - meaning that future results should check if part of a patients 'wear period' is yet to be requested for export. I.e. - additional logic is required to check if the exports are inclusive of the whole data period
the result of this pipeline needs to be in .csv format and potentially be emailed directly (cc' WP3 devs) to VTT to request the export
given a variable timespan for the actual export to appear in the S3 bucket, we will have to run the VTT pipeline more frequent than weekly. I'd suggest 3x a week.
Similar to our other pipelines, we need a repeating pipeline (i.e. cronjob) that queries the UCAM API for all known records, and transforms the result to a request to VTT to export their data. Given our current infrastructure, this would look like another
`DAG
and use similar steps to achieve this.Additional considerations:
today
' - meaning that future results should check if part of a patients 'wear period' is yet to be requested for export. I.e. - additional logic is required to check if the exports are inclusive of the whole data period.csv
format and potentially be emailed directly (cc' WP3 devs) to VTT to request the export