scholl-lab / vcf-filtering

A collection of scripts for filtering annotated variant call format files
MIT License
2 stars 1 forks source link

Feature Request: Allow Filtering for All Genes #51

Closed berntpopp closed 1 month ago

berntpopp commented 1 month ago

Description

The current script filter_variants.sh needs to be extended to allow filtering for all genes. The easiest solution is to run snpEff -Xmx8g genes2bed $reference with an empty gene name. We should implement a new option to allow gene_name to be set to "all" and then handle it by setting $gene_name to an empty value. All relevant checks and processes should be adapted accordingly.

Proposed Changes

  1. Allow "all" as a gene_name value:

    • Modify the script to recognize "all" as a special value for the gene_name parameter.
    • When "all" is specified, set $gene_name to an empty string.
  2. Adapt genes2bed command:

    • Ensure the genes2bed command is executed with an empty gene name if "all" is specified, which results in creating a BED file for all genes.
  3. Update checks and processes:

    • Modify all checks and processes to handle the scenario where gene_name is "all".
    • Ensure the script can handle the filtering and processing of variants for all genes without any issues.

Tasks

Benefits

Additional Notes