compomics / searchgui

Highly adaptable common interface for proteomics search and de novo engines
http://compomics.github.io/projects/searchgui.html
38 stars 16 forks source link

Trypsin digestion, sequence not ends with "KR" and at the end of a protein sequence #365

Closed ATPs closed 8 months ago

ATPs commented 8 months ago

SearchGUI-4.2.17

get result like:

Protein(s)                   protein126447,protein674101,protein870834,prot...
Sequence                                                          GDNITLLQSVSN
AAs Before                                                         LK;LK;PK;LK
AAs After                                                                  ;;;
Position                                                          80;40;67;110
Modified Sequence                                        NH2-GDNITLLQSVSN-COOH
Variable Modifications                                                     NaN
Fixed Modifications                                                        NaN
Spectrum File                                                       1302-DDA-6
Spectrum Title                  controllerType=0 controllerNumber=1 scan=35921
Spectrum Scan Number                                                       NaN
RT                                                                  4336.04496
m/z                                                                 630.827576
Measured Charge                                                             2+
Identification Charge                                                        2
Theoretical Mass                                                   1259.635745
Isotope Number                                                               0
Precursor m/z Error [ppm]                                             3.846943
Localization Confidence                                                    NaN
Probabilistic PTM score                                                    NaN
D-score                                                                    NaN
Confidence [%]                                                        98.78327
Validation                                                           Confident

I used Trypsin as the enzyme, with settings for modifications. Why the sequences does not end with "KR". Here, itendis with "N" and AAs after are "LK" or "PK".

So the "Protein(s) " is any protein with the peptide sequence, regardless of whether it fits the Trypsin digestion rule?

ATPs commented 8 months ago

I guess it is solved... It is a feature of peptideshaker...

hbarsnes commented 8 months ago

I guess it is solved... It is a feature of peptideshaker...

This behavior depends on the search engines used. For example, X! Tandem will map such peptides in its second pass search. Does this match come from X! Tandem?