nf-cmgg / germline

A nextflow pipeline for calling and annotating small germline variants from short DNA reads for WES and WGS data
https://nf-cmgg.github.io/germline/
MIT License
6 stars 1 forks source link

Use contigs BED file for genomicsdb #145

Closed nvnieuwk closed 10 months ago

nvnieuwk commented 1 year ago

Description of feature

Use contigs BED file for genomicsdb instead of the normal one. This way genomicsdb can be run more efficiently with --merge-contigs-into-num-partitions

nvnieuwk commented 1 year ago

Also update the arguments to improve the process for bigger and smaller batches (e.g =>)

    withName: "^.*JOINT_GENOTYPING:GENOMICSDBIMPORT\$" {
        time = "3d"
        memory = 200.GB
        cpus = 32
        ext.args = {
                    [
            "--batch-size 80",
                        "--overwrite-existing-genomicsdb-workspace true",
                        "--genomicsdb-shared-posixfs-optimizations true",
                        "--merge-input-intervals true",
                        "--bypass-feature-reader true",
            "--max-num-intervals-to-import-in-parallel 30",
            "--reader-threads 30",
            "--bypass-feature-reader",
            "--consolidate true"
                    ].join(" ")
                }
    }
nvnieuwk commented 10 months ago

Added in #144