How to treat searches with independent FASTA files

NikolaMonteverde commented 4 weeks ago

- Describe the feature: In a recent experiment, I was searching 10 different sets of immunopeptidomics samples against 10 unique FASTAs. Each set of samples is from a tumor with known mutations which is why we use a custom FASTA for each. I want to compare the 10 samples side by side to see how broad changes in the immunopeptidome look not just the mutations. I think that if I simply fuse together the results tables, I will get a lot of missing values due to accumulation of false identifications in the 10 sample sets. Is there a way to correct for this? One alternative is avoiding the issue altogether by running all the samples together with a canonical FASTA for broad changes in immunopeptidome and then search with unique FASTAs for the mutations.

fcyu commented 4 weeks ago

Maybe combine those 10 databases and search your data together?

Best,

Fengchao

anesvi commented 3 weeks ago

I don't think there one clear way we can suggest. Depends on how noisy the databases are, what you plan to do with the IDs later. This is more of a research question.

Nesvilab / FragPipe

How to treat searches with independent FASTA files #1858