Closed jpfeuffer closed 1 year ago
The step can use the pyopenms container/conda
An example is here: mzml_dataframe_file_part I extracted information about MS1, MS2, MS3 (if exist) from mzMLs, where MS1 and MS3 only have information about whether they exist or not. What is done is to collect all mzMLs and extract information to make a dataframe, without statistics. The final csv file is passed to pmultiqc for processing.
Wouldn't it be smarter to have a column MSlevel that then has the values 1,2,3,... ?
It just takes too long during multiqc execution. Could be done in the mzML indexing step. You can write out a table:
spec_id, peaks, base int, ...