arXiv / html_feedback

Supports a student project developing a UI for feedback on arXiv articles rendered as html.
MIT License
18 stars 3 forks source link

LaTeX commands quoted inside text of paper impact HTML output #2105

Open EHCliffe opened 1 month ago

EHCliffe commented 1 month ago

Description

In this paper (ironically "HTML papers on arXiv: why it’s important, and how we made it happen") the commands \section and \paragraph are within the text of the paper. Rather than becoming part of the text of the HTML it appears that at least one has ended up being interpreted as part of the structure of the LaTeX document. There is an untitled section 0.5 which corresponds to the position of the \section command inside the text. The \paragraph command seems to produce a level of confusion. Though the content of the lost text (involving \section but also html tags quoted in the text) could also be the cause. Some of the text is lost as a result.

Impacts: Confusing structure Some text is lost making the remainder confusing to read - the quantity of text lost isn't apparent without looking at the PDF.

(Optional:) Please add any files, screenshots, or other information here.

image

and the PDF covering the same part of the text: image

(Required) What is this issue most closely related to? Select one.

Choose One

Internal issue ID

72fb5729-476b-4052-9f9a-e654d4a8e8d2

Paper URL

https://arxiv.org/html/2402.08954v1

Browser

Chrome/128.0.0.0

Device Type

Windows desktop

github-actions[bot] commented 1 month ago

Hello @EHCliffe, thanks for the issue report! We are reviewing your report and will address it as soon as possible.

dginev commented 1 month ago

I may have mentioned that easychair.cls needs LaTeXML support when that paper was being submitted, and it is just as true today as it was then.

Thank you for the report!