MassBank / MassBank-web

The web server application and directly connected components for a MassBank web server
14 stars 22 forks source link

Change to PUBLICATION tag in record format #157

Open meier-rene opened 5 years ago

meier-rene commented 5 years ago

I would like to make a minor change to the PUBLICATION tag in the record format. At the moment we have in some cases several publication references in one PUBLICATION tag. This makes automatic processing of this information cumbersome and I would like to change it to have one PUBLICATION tag for one reference as we already have for CH$NAME or COMMENT. Because this introduces changes to the record format I'm putting this issue up for discussion. If you any reasons why this shouldn't be done please comment.

tsufz commented 3 years ago

@meier-rene, for the dataset object, we have a schema (https://developers.google.com/search/docs/data-types/dataset). This is less challenging than the creator object (#194).

sneumann commented 3 years ago

Why not make PUBLICATION a 0..N multiplicity, i.e. one tag per publication ? A difficulty will then be how to align those with the AUTHOR tag. The mzTab format, because then line ordering and hierarchical relationships between lines, which our record format currently does not handle well.
The mzTab format is using a way to number things: (see contact[2]-email and publication[1]) in

https://github.com/HUPO-PSI/mzTab/blob/master/examples/2_0-Metabolomics_Release/StandardMix_negative_exportSpeciesLevel.mzTab#L9

as specified in https://github.com/HUPO-PSI/mzTab/blob/master/specification_document-releases/2_0-Metabolomics-Release/mzTab_format_specification_2_0-M_release.adoc#6212-publication1-n

Yours, Steffen