Closed jorainer closed 1 year ago
Thanks for spotting. That is from an ancient perl script extracting spectral data from ACD SpecManager database. The duplication might be a result of merging FRAGMENTATION_MODE
and FRAGMENTATION_METHOD
Yours, Steffen
Are you planning to fix that and provide an updated 2022.12 release? just to know if I should make an intermediate fix or wait for the official fix...
I will not fix existing releases, but rather release fixed version with new version number. If you can easily fix that for your dataset right now, then please do it. In addition to fixing this particular problem, I would also like to implement a automatic test which identifies similar problems for existing data and future contributions.
Thanks @meier-rene for the update - do you know already when you will release the next version?
Asap, but first I would like to release the software stack. I would guess end of the week I might be done with fixing data.
The issue is fixed with 43e98fbf14 in dev. The issue you reported was the only one of that kind. Could not find any other duplicates. I want to wait for the answer of an contributor before I make a new data release. Release will be very soon.
Perfect! Thanks! and yes, I also checked all records and these were the only ones with duplicates.
Data is ready to be released, but I cant do it. The merge to main branch requires the successful report from the CI pipeline. Unfortunately the maven repo, from which we pull the SPLASH library is down. Nothing I can do to fix that. I would like to wait a little bit, before I make major changes to build infrastructure...
All good - just post here (or even better close the issue) once the data is released so I get notified automatically.
Solved with 2012.12.1 release. Thanks for reporting!
I stumbled across some inconsistencies in the MassBank
AC_MASS_SPECTROMETRY
table: there are 21 spectra (records) that have a duplicatedFRAGMENTATION_MODE
SUBTYPE
:Example:
there is twice the
FRAGMENTATION_MODE
"CID"
listed for this spectrum.This happens for in total 21 records:
would be nice if that could be fixed in the 2022.12 release as this causes errors in my scripts to query the MassBank database (where I expect only a single type of fragmentation mode per spectrum).