openvax / topiary

Predict mutated T-cell epitopes from sequencing data
Apache License 2.0
27 stars 9 forks source link

VCF annotation #56

Open serge2016 opened 8 years ago

serge2016 commented 8 years ago

Hello! Is it possible to generate an annotated tab-separated file from VCF for future processing with pVAC-Seq? If I understand correctlym your first prototype (immuno) could do it.

iskandr commented 8 years ago

Hey @serge2016,

What fields do you need in the output? I haven't used pVAC-Seq so I'm not sure what it expects.

serge2016 commented 8 years ago

Dear @iskandr ,

All this is on their main page: https://github.com/griffithlab/pVAC-Seq

iskandr commented 8 years ago

Huh, it looks like pVAC-Seq expects only single amino acid substitutions. Maybe @jasonwalker80, @malachig, or @jhundal can shed light on how I should represent frameshifts and other multi-residue mutations as inputs to pVAC-Seq.

jhundal commented 8 years ago

Hi @iskandr and @serge2016 The current release of pVAC-Seq supports only single amino acid substitutions, but we do plan to add indel support as well as VCF file input in the upcoming releases. You could track the progress here: https://github.com/griffithlab/pVAC-Seq/issues/33

iskandr commented 8 years ago

@serge2016 I could add a flag for generating a pVACSeq TSV that ignores indels. Would that work for your purposes?

serge2016 commented 8 years ago

@iskandr It would be great! Thank you!

susannasiebert commented 8 years ago

@serge2016 @iskandr I'm not sure how much work you want to put into supporting the current pVAC-Seq input format since we are actively working on supporting VCFs. The current deadline for this is June 21st. That release would also support frameshifts and inframe indels.

iskandr commented 8 years ago

Hey @susannasiebert, what's your plan for translating frameshifts?

Also, @serge2016, do you still want me to implement this output format for "legacy" pVACSeq?

serge2016 commented 8 years ago

@iskandr, it seems to me that some kind of customizable output would be very useful. If it takes not very much time to implement this, I'd very thankful for such option.