Use contigs BED file for genomicsdb

nvnieuwk commented 1 year ago

Description of feature

Use contigs BED file for genomicsdb instead of the normal one. This way genomicsdb can be run more efficiently with --merge-contigs-into-num-partitions

nvnieuwk commented 1 year ago

Also update the arguments to improve the process for bigger and smaller batches (e.g =>)

    withName: "^.*JOINT_GENOTYPING:GENOMICSDBIMPORT\$" {
        time = "3d"
        memory = 200.GB
        cpus = 32
        ext.args = {
                    [
            "--batch-size 80",
                        "--overwrite-existing-genomicsdb-workspace true",
                        "--genomicsdb-shared-posixfs-optimizations true",
                        "--merge-input-intervals true",
                        "--bypass-feature-reader true",
            "--max-num-intervals-to-import-in-parallel 30",
            "--reader-threads 30",
            "--bypass-feature-reader",
            "--consolidate true"
                    ].join(" ")
                }
    }

nvnieuwk commented 10 months ago

Added in #144

nf-cmgg / germline

Use contigs BED file for genomicsdb #145

Description of feature