smith-chem-wisc / FlashLFQ

Ultra-fast label-free quantification algorithm for mass-spectrometry proteomics
GNU Lesser General Public License v3.0
19 stars 14 forks source link

format of modifications on the sequence #54

Closed Dmorgen closed 5 years ago

Dmorgen commented 6 years ago

Hi,

How do I add modifications (specifically various glycans) to the peptide sequence? by mass? any specific format (square brackets?)

Thanks! David.

trishorts commented 6 years ago

in the G-PTM-D task, some are looked for automatically. you can click on the triangle and open it up to see which are included. image

trishorts commented 6 years ago

That's in metamorpheus. FlashLFQ should be able to read the output and integrate accordingly. If you have mass spec data from another vendor, then I guess you have to follow the format for modifications that you get using MetaMorpheus.

Dmorgen commented 6 years ago

OK, thanks! I'll have a look at how MM writes glycans.

rmillikin commented 6 years ago

FlashLFQ does not actually use modifications for anything other than to shift the mass to look for, because as you probably realize, different programs write modifications differently and there are simply too many for me to parse into chemical formulas. In essence, FlashLFQ calculates the isotopic distribution using the base amino acid sequence only, and uses the monoisotopic mass provided in the ID file. Large glycans may throw this system off, because they can change the isotopic distribution dramatically; I have not looked into that problem extensively.

So, in essence, it does not matter what format you put the modification in, only that it remains the same between two identifications of the same type.