griffithlab / pVACtools

http://www.pvactools.org
BSD 3-Clause Clear License
143 stars 59 forks source link

Add ability to run the reference proteome step on the full peptide sequence in a fasta #787

Open susannasiebert opened 2 years ago

malachig commented 1 year ago

In general there is quite a lot of interest in having better integration of the long-peptide forms. Everything in our epitopes results files and pVACview is focused on the predicted binding peptide. And this makes sense. But it would also be really helpful to consider factors in the context of the longer peptide that harbors the core peptide such as:

susannasiebert commented 1 year ago

Current pVACseq runs already saves a peptide fasta (albeit only with a flanking length of max(epitope_lengths) - 1), which we use for a lot of these steps. I think this would require us to implement another parameter "vaccine candidate length" or similar to create a fasta with even longer lengths.

This could then be easily used for running:

For cleavage site predictions we need an additional 10 aa flanking length on top of the desired vaccine peptide length (see #615)

I almost envision a separate report for this that wold enumerate vaccine peptides and the issues identified above. This report could also include the Best Peptide for class I and class II. I'd have to think a bit more about how we could report the alternate splice sites.