MSGFPlus / msgfplus

MS-GF+ (aka MSGF+ or MSGFPlus) performs peptide identification by scoring MS/MS spectra against peptides derived from a protein sequence database.
Other
77 stars 36 forks source link

Question about my parameter file #151

Open JannikSchneider12 opened 10 months ago

JannikSchneider12 commented 10 months ago

Hello everyone,

I want to use a parameter file to execute a search that was previously done in Sequest. So I want to "translate" the parameters to MSGF+. However nearly every q-value in the results is around 1.0, so I think I have some issues in my parameters. I would be glad, if someone could have a look.

So these are the original Sequest params: Screenshot 2024-01-31 162730

And this is my current MSGF+ parameter file: Screenshot 2024-02-01 062002 Screenshot 2024-02-01 062036 Screenshot 2024-02-01 062053

Thanks for your time and help

FarmGeek4Life commented 10 months ago

I'm not familiar with using Sequest; does the Sequest search automatically include the TMT6Plex static modifications?

alchemistmatt commented 10 months ago

The TMT6Plex static mod is definitely the issue, since your SEQUEST settings did not have TMT and that is a static mod. We have several example parameter files available at https://github.com/MSGFPlus/msgfplus/tree/master/docs/ParameterFiles

I started with file MSGFPlus_Tryp_Dyn_MetOx_STYPhos_Stat_CysAlk_20ppmParTol.txt and updated it to match your settings, specifying a ±10 ppm search tolerance and these static and dynamic mods:

StaticMod=C2H3N1O1,             C,   fix, any,         Carbamidomethyl    # Fixed Carbamidomethyl C (alkylation, +57.0215)

DynamicMod=O1,                  M,   opt, any,         Oxidation          # Oxidized methionine (+15.9949)
DynamicMod=HO3P,                STY, opt, any,         Phospho            # Phosphorylation STY
DynamicMod=C2H2O,               *,   opt, Prot-N-term, Acetyl             # Acetylation, Protein N-term
DynamicMod=H-9 C-5 N-1 O-1 S-1, *,   opt, Prot-N-term, Met-loss           # Methionine removal from Protein N-term (-131.040485)
DynamicMod=H-7 C-3 N-1 S-1,     *,   opt, Prot-N-term, Met-loss-Acet      # Methionine removal from Protein N-term, then acetylation (-89.02992)

Note that while methionine removal from the protein N-terminus is a common dynamic mod that many search engines check for, I'm not familiar with removing methionine then adding acetylation (and UniMod does not have a corresponding mod for a loss of 89.0299 Da). Nevertheless, it should work. I added spaces in the empirical formulas for the Met-loss mods for readability.

Here's the link to the customized parameter file: MSGFPlus_Tryp_Dyn_MetOx_STYPhos_ProtNTermAcet_ProtNTermMetLoss_Stat_CysAlk_10ppmParTol.txt

JannikSchneider12 commented 10 months ago

Hey,

Thanks for your help! Yes I noticed the static modification after I posted this too and it was really dumb that I didnt see this earlier. This was the main reason and now I am getting more significant identifications than in the original search, as expected

JannikSchneider12 commented 10 months ago

Regarding the Met-loss+Acetyl, I found this in unimod, so maybe my mass specification was not precise enough? Screenshot 2024-02-02 102754

alchemistmatt commented 10 months ago

I completely missed Met-loss+Acetyl , yes, that's the interim name for the mod.
Thus, change the mod name from Met-loss-Acet to Met-loss+Acetyl , giving:

DynamicMod=H-7 C-3 N-1 S-1,     *,   opt, Prot-N-term, Met-loss+Acetyl      # Methionine removal from Protein N-term, then acetylation (-89.02992)