openvax / topiary

Predict mutated T-cell epitopes from sequencing data
Apache License 2.0
27 stars 9 forks source link

Replace "self-ligandome" filtering with just search over the full proteome #49

Open iskandr opened 8 years ago

iskandr commented 8 years ago

I'm not sure it's worth the extra complexity to filter candidate epitopes by a "self-ligandome" set of peptides. No one other than Kipp ever generated these peptide sets and no one seems to use them. So, in the interest of using a more precise filter, Topiary ended up without any kind of self-filtering whatsoever. Simpler alternative: chop up the reference proteome from pyensembl and check for membership in that peptide set.

timodonnell commented 8 years ago

+1

tavinathanson commented 8 years ago

Aside from not needing to run MHC binding prediction over the proteome, remind me how that differs in output? Is it the same unless we were to do approximate matching with the proteome, in which case we'd want to run MHC binding prediction for additional peptides (possibly via the self-ligandome data)?

iskandr commented 8 years ago

I think all we'd be giving up is the ability to later use inexact string matches (such as looking at only interior TCR-facing residues).

tavinathanson commented 8 years ago

Given that we aren't doing that currently, 👍

JPFinnigan commented 8 years ago

Sacha Gnjatic's group is incorporating minimal Levenshtein distance between a predicted neoepitope and all members of the self-ligandome as a metric for peptide ranking. I agree that filtering against the self-proteome is a good (perhaps best) default behavior but I wouldn't dismantle the self-ligandome components just yet unless they are impeding progress.