Open ckoscher opened 6 years ago
As per sprint planning meeting, we will provide plain title and inline XML title versions.
Jennifer Lin [Administrator] added a comment - 11/Oct/17 10:17 PM
Note that we've had a user return asking for update on this (>1 year old) issue.
We will create an entry that preserves the XML for all records across all content types, regardless of whether it has markup in the title. The content for this entry will duplicate the as-deposited XML title into a JSON field called 'title-xml'. This field will be part of the JSON document but not an indexed field in Solr. The field will contain XML text as follows: a) content must have the following opening tag:
<title xmlns="http://www.crossref.org/xschema/1.1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:mml="http://www.w3.org/1998/Math/MathML" xsi:schemaLocation="http://www.crossref.org/xschema/1.1 http://doi.crossref.org/schemas/unixref1.1.xsd">
b) content will have the closing tag:
</title>
c) contents between opening and closing tag will be the contents of the
“title-xml”: [“<title xmlns="http://www.crossref.org/xschema/1.1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:mml="http://www.w3.org/1998/Math/MathML" xsi:schemaLocation="http://www.crossref.org/xschema/1.1 http://doi.crossref.org/schemas/unixref1.1.xsd">NeisseriaBase: a specialised\n <i>Neisseria</i>\n genomic resource and analysis platform</title>”]
Multi-title example: Input
<titles>
<title>
NeisseriaBase: a specialised
<i>Neisseria</i>
genomic resource and analysis platform
</title>
<title>My paper</title>
</titles>
Output
"title": ["NeisseriaBase: a specialised Neisseria genomic resource and analysis platform", “My paper”],
"title-xml": [
“<title xmlns=\"http://www.crossref.org/xschema/1.1\" xmlns:xsi=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns:mml=\"http://www.w3.org/1998/Math/MathML\" xsi:schemaLocation=\"http://www.crossref.org/xschema/1.1 http://doi.crossref.org/schemas/unixref1.1.xsd\">
NeisseriaBase: a specialised\n<i>Neisseria</i>\n genomic resource and analysis platform</title>”,
“<title xmlns=\"http://www.crossref.org/xschema/1.1\" xmlns:xsi=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns:mml=\"http://www.w3.org/1998/Math/MathML\" xsi:schemaLocation=\"http://www.crossref.org/xschema/1.1 http://doi.crossref.org/schemas/unixref1.1.xsd\">
My paper </title>”
]
Looks good.
Update: @ckoscher had a convo with member and new solution now in the horizon: We will get the XSLT to format the mathml tech string. This will be run as part of the deposit processing so all downstream outputs will be sorted. Result will retain markdown so that systems can properly represent the characters.
CS-3845
=== From GO-312
I'm not clear on how MathML should be handled in JSON output (if at all) but APS has noted that in the JSON output,MathML elements are just appearing as separate elements. This is causing display issues downstream.
For example:
http://api.crossref.org/works/10.1103/PhysRevLett.112.102502
title appears as:
β Decay of Ca 38 : Sensitive test of Isospin Symmetry-Breaking Corrections from Mirror Superallowed 0 + → 0 + Transitions
deposited as: