Closed marekhorst closed 3 weeks ago
As a follow-up of this task the PMC cache should be updated by dropping the most recent update including an outcome of Springer records parsing and rerunning the PMC ingestion involving cache update (e.g. as a part of the IIS primary job).
The fix for #1464 became a part of #1466 fix and got introduced with this commit: https://github.com/openaire/iis/commit/a8f5a302877d9241e94d3a6f67c364e19cddc7cf
Originally requested in redmine: https://support.openaire.eu/issues/9982.
After running JATS ingester module (which was originally prepared to handle JATS records coming from PubMed) on Springer JATS records it turned out the affiliations are not correctly linked to authors. As reported by Miriam in #9976#note-3:
I was able to reproduce this behavior in a dedicated test case which revealed slightly different contributor encoding involving multiple layers of nested
contrib-group
,contrib
andcollab
elements such as:which was not properly handled by the
ArticleMetaXmlHandler
.Additionally it was discovered an author name was also broken by the nested structure of contributors where an institution name from parent contributor was glued with the first child contributor name. This also needs to be fixed.