This script is used to take multiple separate batch outputs, merge them all, and produce new batches from the larger cohort.
This overcomes a limitation in GATK-SV where batches > 500 create errors when collecting/calculating all the median coverage files (huge amount of data), and allows us to take the pre-calculated metrics from a larger number of samples when deciding new batches.
This is just a quick bump to let us define the output batch sizes with more flexibility
This script is used to take multiple separate batch outputs, merge them all, and produce new batches from the larger cohort.
This overcomes a limitation in GATK-SV where batches > 500 create errors when collecting/calculating all the median coverage files (huge amount of data), and allows us to take the pre-calculated metrics from a larger number of samples when deciding new batches.
This is just a quick bump to let us define the output batch sizes with more flexibility