What feature is being added or bug is being addressed?
Closes #144. This PR modifies autogvp and annotation filtering scripts to retain autopvs1 transcript annotation as final outputted annotation for each variant.
What was your approach?
modified 01-annotate_variants_CAVATICA_input.R and annotate_variants_custom_input.R to retain autopvs1 Feature column.
modified 04-filter_gene_annotations.R to filter annotated vcf for only vcf_id-Feature pairs in autogvp output.
What GitHub issue does your pull request address?
144
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Please run shell script on both pbta and custom test files
Is there anything that you want to discuss further?
There are rare instances (I believe only in the custom test files) in which variants are annotated as intergenic by VEP, but have transcript annotation by AutoPVS1. This results in NA annotation columns for these variants in the final output, since the AutoPVS1 transcript is not found in the VEP vcf file. I will plan to run this on larger data sets to determine if this only happens with intergenic variants, in which case we can annotate them as such in the final output.
Documentation Checklist
[X] The function has examples to showcase the usage
hmm good point...I suppose we could add instructions for modifying output_colnames, it's pretty comprehensive now but there may be other annotations we're not capturing here
Purpose/implementation Section
What feature is being added or bug is being addressed?
Closes #144. This PR modifies autogvp and annotation filtering scripts to retain autopvs1 transcript annotation as final outputted annotation for each variant.
What was your approach?
01-annotate_variants_CAVATICA_input.R
andannotate_variants_custom_input.R
to retain autopvs1Feature
column.04-filter_gene_annotations.R
to filter annotated vcf for onlyvcf_id
-Feature
pairs in autogvp output.What GitHub issue does your pull request address?
144
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Please run shell script on both pbta and custom test files
Is there anything that you want to discuss further?
There are rare instances (I believe only in the custom test files) in which variants are annotated as intergenic by VEP, but have transcript annotation by AutoPVS1. This results in
NA
annotation columns for these variants in the final output, since the AutoPVS1 transcript is not found in the VEP vcf file. I will plan to run this on larger data sets to determine if this only happens with intergenic variants, in which case we can annotate them as such in the final output.Documentation Checklist