Closed jsotobroad closed 8 years ago
I confirmed with EB that the file used here does not need to be protected, so that should make things easier.
Whoever takes this ticket... if you find it takes too long to run still let me know and we can work together to slim down the use case even further I think (but it may be just fine the way it is)
When running the following example wdl task:
where
SplitGvcfTouch
is called like:indexing_list
is an array of integers 0-94,sample_name
can be any string, andinterval_list
is attached wgs_split_10000000_tiledb.intervalist.txtwith these inputs, each scattered task should be globbing an array of 901 elements for both
gvcf_list
andgvcf_index_list
When this is run on JES backend, according to the timing diagram it is taking 25-30 min of "cromwell final overhead" which is much longer than ever previously seen. Once all of the scatter tasks are completed, the implicit gatherer starts but never finishes(at least I haven't seen it finish yet). This task also causes issues when trying to call cache previous results.