griffithlab / pVACtools

http://www.pvactools.org
BSD 3-Clause Clear License
129 stars 58 forks source link

Outputting accounted-for germline variants #1088

Open evelyn-schmidt opened 3 months ago

evelyn-schmidt commented 3 months ago

https://github.com/griffithlab/pVACtools/blob/7c4b383c554a62352c74a92884bc5214267810b1/pvactools/tools/pvacseq/generate_protein_fasta.py#L45C1-L48C6

The generate_protein_fasta.py script uses the phased-proximal-variants-vcf to create correct predicted sequences within the fasta file. It would be helpful if the list of variants that affect the final sequences were output to a file that could be easily viewed and eventually incorporated into the final neoantigen review files.

As of now, during the manual review process of the neoantigen candidates, reviewers have to check to make sure that germline variants are correctly accounted for in the sequences (and a majority of the time they are). For clarity that these germline variants are being reviewed, a list of the accounted-for germline variants would be helpful.

susannasiebert commented 3 months ago

This should be pretty straightforward. Such a file already gets created by the VCF conversion step so it just needs to be copied over from its temporary location.