HUPO-PSI / mzIdentML

Repository for mzIdentML and the corresponding examples
23 stars 24 forks source link

Mod scores - problem reporting a combined score for multiple sites #112

Open andrewrobertjones opened 5 years ago

andrewrobertjones commented 5 years ago

Ville Koskinen is working on mzIdentML 1.2 export from Mascot and reports this issue:

we're looking at what changes are needed to export mzIdentML 1.2 from Mascot Server. One problem we've encountered so far is with site localisation scores. Here's an example query with two modifications and multiple permutations (scroll to the bottom of page): http://www.matrixscience.com/cgi/peptide_view.pl?file=..%2Fdata%2F20190108%2FF001291.dat;_msresflags=3138;_msresflags2=266;ave_thresh=17;db_idx=2;hit=1;index=TRAP_PLAFA;px=1;query=8118;section=5 The confidence percentages are based on the score difference between adjacent ranks. There is no site-specific score; the percentage is for the joint assignment confidence of Oxidation and dHex(1)Hex(1).

1) One possibility is to have the same modification index for different elements. For example:

`

TASCGVWDEWSPCSVTCGK ` Then, under , the line referring to modification index 1 means a simultaneous assignment. We could then encode the 10 different permutations like: ` ... ` This is assuming the site alternation is in the same order as the elements (oxidation|dhex). Is this allowed, and is this the intended encoding? 2) If there must be a one-to-one mapping between modification index and modification name/delta, another possibility is to extend the regular expression constraint. Maybe something like this would work? `` The least ambiguous syntax would be the following, but I realise this is a big departure from the existing syntax: ``
vrkosk commented 2 years ago

What mzIdentML software is able to read and display localisation scores using the 1.2.0 syntax? Is there any?