Open oolonek opened 1 month ago
Looking at the mzXML we observe a filechecksum at the end of the file
<fileChecksum>86c35a0d4acb405e1000492f54f9e0fa55cb353b</fileChecksum>
We might use this checksum to identify uniquely the analysis in the graph
In fact it appears that their is another filw checsum at the beginning
<sourceFileList count="1">
<sourceFile id="RAW1" name="20240307_EB_dbgi_001195_01_01.raw" location="file:///Y:\public\QE_plus_unifr\raw\2024\03">
<cvParam cvRef="MS" accession="MS:1000768" name="Thermo nativeID format" value=""/>
<cvParam cvRef="MS" accession="MS:1000563" name="Thermo RAW format" value=""/>
<cvParam cvRef="MS" accession="MS:1000569" name="SHA-1" value="9bb6474f5663680a302fdc55c65b620f422e9d61"/>
We should extract also this one in our metadata file
@edouardbruelhart So I just had a look and confirm that
<cvParam cvRef="MS" accession="MS:1000569" name="SHA-1" value="9bb6474f5663680a302fdc55c65b620f422e9d61"/>
<fileChecksum>86c35a0d4acb405e1000492f54f9e0fa55cb353b</fileChecksum>
We are thinking of transitionig from zthe "sample centric" to an "analysis centric" approaches. This means that there should no longer be
/pos
and/neg
subdirectories within the sample dir but rather one unique dir per analysis.Using matchms or pyteomics we would like to extract the following metadata from a .mzML file
We are thinking of transitionig from zthe "sample centric" to an "analysis centric" approaches. This means that there should no longer be
/pos
and/neg
subdirectories within the sample dir but rather one unique dir per analysis.Using matchms or pyteomics we would like to extract the following metadata from a .mzML file
id
name
location
cvParam
accession
name
value
software
software
id
version
e.g. Xcalibur 2.9-290033/2.9.0.2926hardware
e.g. Q Exactive Plus
For each value, when possible we retrieve the associated MS: ontology indentifiers
analysis related measure
[ ] file checksum
[x] Number of scans