Closed apulvino closed 6 months ago
Nope. The pcgroup number doesn't hold an significance itself. All it says is that features with the same pcgroup are probably from the same molecule based on the correlation analysis the functions have done.
Is there a useful tool which can handle annotating these data with metabolite common names?
Not in any magic way. Identification in metabolomics is a large subject that you cannot find a tool that just does. There are many complex tools that can help you though. Database matching can be done with https://github.com/rformassspectrometry/MetaboAnnotation. Typically that would be done with MS2 data though. But you could use the CAMERA groups as pseudo MS2 data.
Other complex tools include SIRIUS and the suite of GNPS tools.
do you know if it's possible to not provide ionization for every mz and intensity value you have for a run of the basic sirius command (ie. sirius [OPTIONS] -z
I am not using these tools myself so I cannot help you. I meant complex in the sense that what they do is a (internally) sophisticated process. I don't think they are that difficult to learn. But what is very important is that they are not trusted to provide truth. There is so much BS published where people use these tools without critical thinking about the likelihood of the results. Identification needs to be verified by old fashioned methods, e.g. standards and considering the alternative structures that are as plausible as whatever top candidate you get. You won't find any magic tool that just spits out a list of names you can sensibly use without manually curation. Those that do that provide only noise to the literature. But this is getting far outside the scope of this issue tracker....
I appreciate the resolution! I have more of an NGS-focused data analysis background and so this was useful insight regarding the state of the field comparatively. The run I'm analyzing is targeted... Is there certain metadata about target standards that I can ask my core for so that I can run some sort of alignment algorithm using my experimental data?
Not sure if this is possible, or what kind of data I would be asking for in this case (in terms of format/what tool would allow me to run such an alignment).
Is the pcgroup ID/value is equivalent to ChemSpider database ID/value? I don't see clarification on whether this is the case or not in the CAMERA documentation and only allusions to it in the documentation from other packages... Thanks ahead for the clarification!