brucemiller / LaTeXML

LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.
http://dlmf.nist.gov/LaTeXML/
Other
957 stars 101 forks source link

JATS: Section number appears within section title text #2345

Open castedo opened 7 months ago

castedo commented 7 months ago

REPRO STEPS:

With doc.tex.txt:

latexmlc --format=jats doc.tex.txt

RESULT:

...

<sec id="S1">
      <title>1 Introduction</title>
...
    <sec id="S2">
      <title>2 Conclusion</title>
...

EXPECTED:

...

<sec id="S1">
      <title>Introduction</title>
...
    <sec id="S2">
      <title>Conclusion</title>
...

VERSION:

# latexmlc -V       
latexmlc (LaTeXML version 0.8.8)

DISCUSSION: The current LaTeXML output differs from pandoc. I wager most tools will not insert section numbering into JATS. I believe most JATS pipelines try to avoid having styling and special formatting inside JATS and instead add those later in a pipeline.

dginev commented 7 months ago

Thank you for making a JATS audit of latexml @castedo - the new issues you have opened are very helpful. Feel free to open more - I will make a new label and tag them.

Looking at the sec documentation, I agree with the report here. I also wanted to attach a JATS sample from PubMed Central here, as another kind of high quality reference. Many more of these are available here

PMC4053489.zip