OpenMS / OpenMS

The codebase of the OpenMS project
https://www.openms.de
Other
477 stars 314 forks source link

Need help for identifying the input features for percolator #6693

Open katrina-uw opened 1 year ago

katrina-uw commented 1 year ago

I'm trying to find out how to map these features back to the percolator: percolator_features

For example, the PSM outputs from the percolator adaptor are shown below: psm_features There are many features above where the name is some id. I don't know the meaning behind since there are no descriptions..

Could you help me to check whether the features in the first figure can be found in the PSM output file (second figure), and what's the id of these features?

Thank you so much!

timosachsenberg commented 1 year ago

Hi The set of features differ between search engines and sometimes even search engine versions. So don’t expect the exact features as in the publication.

you can look up the meaning of the cv terms here https://github.com/HUPO-PSI/psi-ms-CV/blob/master/psi-ms.obo .

In your input data the features written by the search engine are the ones that start with COMET.

timosachsenberg commented 1 year ago

Feel free to join our discord channel to discuss with other developers what you want to achieve with OpenMS https://discord.gg/4TAGhqJ7s5 . Sometimes there might already been scripts etc. out there by others that could simplify things for you and people are usually happy to share.

jpfeuffer commented 1 year ago

Yes the MS:XXXXXX scores are usually coming from Percolator itself (i.e. output). You can check them in the ontology. The input features specific for Comet are defined here: https://github.com/OpenMS/OpenMS/blob/develop/src/openms/source/ANALYSIS/ID/PercolatorFeatureSetHelper.cpp#L179

No matter which search engine also basic features like m/z, RT, charge are added, IIRC.

I think they are pretty stable but as Timo said, no guarantee that they were/will be all the same for older/newer versions.