Nesvilab / MSFragger

Ultrafast, comprehensive peptide identification for mass spectrometry–based proteomics
https://msfragger.nesvilab.org
103 stars 7 forks source link

Best strategy for obtaining full scored unfiltered list of peptides and decoys #133

Closed TomClarkMassSpec closed 3 years ago

TomClarkMassSpec commented 3 years ago

Hello, So I have python code as part of a project for downstream analysis. For it to work correctly, I need to obtain the list of scored peptides or PSMs including those that have been filtered out including decoys. I'd really like to leave the peptide spectrum matching and scoring to MSFragger / peptide prophet without my meddling. The output files *.tsv and such have outputs which are great but only include the filtered results. Before I continue, I'd like to ask the best strategy for obtaining the full lists of scored peptides (pepXML, mzID, or PIN??) or is this something I need to get my hands really dirty with? My project is about protein inferencing, not peptide ID which is clearly already done well by applications such as MS Fragger. Probably not the best question for Github but if you could send me in the best direction (for unfiltered PSMs including decoys), it would be most appreciated. Thanks Tom

fcyu commented 3 years ago

You can 1) check 'print decoys' and set all FDR threshold in FragPipe; or 2) let MSFragger output .tsv or pin file.

Best,

Fengchao

TomClarkMassSpec commented 3 years ago

Thanks Fengchao, I will try just this again. I just upgraded through several revisions of MSFragger/Fragpipe. It seemed to me setting the FDR thresholds up to an arbitrarily high number caused an issue but maybe this was propagated through to the peptide cut offs through implementation of protein prophet. When you say 'set all threshold', you mean I can should set to 0.9 or is there a command I may be unaware of?Tom

fcyu commented 3 years ago

Hi Tom,

You can change the FDR threshold in the ‘Filter' command: image

By setting 1 for PSM, peptide, ion, and protein level FDR thresholds; and 0 for peptide and protein probability thresholds, you should be able to get all PSMs.

Best,

Fengchao

anesvi commented 3 years ago

If you want no filtering at all, then also need to set --minprot 0 in PeptideProphet. Because by default it writes PSM with probability above 0.05

From: Fengchao notifications@github.com Sent: Tuesday, February 23, 2021 7:53 PM To: Nesvilab/MSFragger MSFragger@noreply.github.com Cc: Subscribed subscribed@noreply.github.com Subject: Re: [Nesvilab/MSFragger] Best strategy for obtaining full scored unfiltered list of peptides and decoys (#133)

External Email - Use Caution

Hi Tom,

You can change the FDR threshold in the ‘Filter' command: [image]https://user-images.githubusercontent.com/6926299/108928454-2f4c0e80-7610-11eb-9ac5-8b4588bdc46d.png

By setting 1 for PSM, peptide, ion, and protein level FDR thresholds; and 0 for peptide and protein probability thresholds, you should be able to get all PSMs.

Best,

Fengchao

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://github.com/Nesvilab/MSFragger/issues/133#issuecomment-784647007, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AIIMM63N5BFQQPATDC7CBO3TAREVPANCNFSM4YDJ2GYA.


Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues

TomClarkMassSpec commented 3 years ago

Thank you Fengchao and Prof. Nesvizskii