va-big-data-genomics / trellis-mvp-functions

Trellis serverless data management framework for variant calling of VA MVP whole-genome sequencing data.
6 stars 1 forks source link

0.5.2: GATK job launched in duplicate #3

Closed pbilling closed 5 years ago

pbilling commented 5 years ago

Issue was that the initiating GATK query was triggered by any Ubam node. With 0.5.2, all relationships are added via triggers, and the OUTPUT relationship (i.e. (fastq-to-ubam)-[OUTPUT]->(Ubam)) returns the Ubam node and sends it to be checked for triggers. This means that the ubam is sent to the trigger function twice; 1) when the node is created and 2) when the OUTPUT relationship is added. Hence, job queries were being triggered twice.

Current fix is to add required header labels to the trigger, so that it is only activated after the relationship has been added, as opposed to after the node is created.