arXiv / html_feedback

Supports a student project developing a UI for feedback on arXiv articles rendered as html.
MIT License
18 stars 3 forks source link

Rendering of figures with LaTeX boxes does not work right #2575

Open tygert opened 1 day ago

tygert commented 1 day ago

Description

The layout of the pages with figures containing LaTeX boxes is not rendering right. The proper rendering is available in the PDF file.

(Optional:) Please add any files, screenshots, or other information here.

No response

(Required) What is this issue most closely related to? Select one.

Figures

Internal issue ID

c17253d8-e9c6-4811-8afa-5959523cee34

Paper URL

https://services.arxiv.org/html/submission/5993144/view

Browser

Firefox/132.0

Device Type

Desktop

html-feedback-bot[bot] commented 1 day ago

Location in document: ltx_caption

Selected HTML:

Refer to caption

m=𝑚absentm=italic_m = 396,326 (with =absent\ell=roman_ℓ = 3,985 distinct scores)
n=𝑛absentn=italic_n = 3,985
Kuiper’s statistic =0.001629/σ=2.456absent0.001629𝜎2.456=0.001629/\sigma=2.456= 0.001629 / italic_σ = 2.456; the asymptotic P-value =0.05622absent0.05622=0.05622= 0.05622
Kolmogorov-Smirnov’s =0.001132/σ=1.707absent0.001132𝜎1.707=0.001132/\sigma=1.707= 0.001132 / italic_σ = 1.707; asymptotic P-value =0.1755absent0.1755=0.1755= 0.1755
ATE =0.0004812/σ=0.7254absent0.0004812𝜎0.7254=0.0004812/\sigma=0.7254= 0.0004812 / italic_σ = 0.7254

(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Figure 3: Have angina or coronary heart disease compared with having had a heart attack, versus BMI. Discerning the range of lowest scores (that is, the BMIs) over which the cumulative plot drops steeply is difficult in all but the reliability diagram with the greatest number of bins. And, unfortunately, the reliability diagram with the greatest number of bins is very noisy. The reliability diagrams also look inconsistent around BMI of 41 (the steep incline in the cumulative graph explains why).
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with =absent\ell=roman_ℓ = 3,985 distinct scores)
n=𝑛absentn=italic_n = 3,985
Kuiper’s statistic =0.002511/σ=3.052absent0.002511𝜎3.052=0.002511/\sigma=3.052= 0.002511 / italic_σ = 3.052; the asymptotic P-value =0.009106absent0.009106=0.009106= 0.009106
Kolmogorov-Smirnov’s =0.001659/σ=2.015absent0.001659𝜎2.015=0.001659/\sigma=2.015= 0.001659 / italic_σ = 2.015; asymptotic P-value =0.08773absent0.08773=0.08773= 0.08773
ATE =0.0008138/σ=0.9889absent0.0008138𝜎0.9889=-0.0008138/\sigma=-0.9889= - 0.0008138 / italic_σ = - 0.9889


(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 4: Have had a stroke compared with having kidney disease, versus BMI. As with all Figures 38, the scores are the survey participants’ BMIs. Discerning that the expected difference between having had a stroke and having kidney disease is roughly constant for scores greater than 35 is really hard from the reliability diagrams, due to noise when there are many bins or insufficient resolution when there are few bins.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with =absent\ell=roman_ℓ = 3,985 distinct scores)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 23,035
n=𝑛absentn=italic_n = 3,985
Kuiper’s statistic =0.01831/σ=4.083absent0.01831𝜎4.083=0.01831/\sigma=4.083= 0.01831 / italic_σ = 4.083; the asymptotic P-value =0.0001781absent0.0001781=0.0001781= 0.0001781
Kolmogorov-Smirnov’s =0.01829/σ=4.078absent0.01829𝜎4.078=0.01829/\sigma=4.078= 0.01829 / italic_σ = 4.078; asymptotic P-value =0.0000908absent0.0000908=0.0000908= 0.0000908
ATE =0.01821/σ=4.061absent0.01821𝜎4.061=0.01821/\sigma=4.061= 0.01821 / italic_σ = 4.061

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 5: Could afford to see a doctor (rather than could not) versus BMI for those who have had a heart attack compared with the full population (the full population includes both those who have had a heart attack and those who have not). There seems to be a Simpson’s Paradox in the reliability diagrams for BMI around 30. Determining the size of the difference for BMI less than 20 is impossible from any but the noisiest of the reliability diagrams, whereas the difference is easy to quantify from the slope of the cumulative graph.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 3,985 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 121,203
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 232,734
n=𝑛absentn=italic_n = 79,220
Kuiper’s statistic =0.002595/σ=4.635absent0.002595𝜎4.635=0.002595/\sigma=4.635= 0.002595 / italic_σ = 4.635; the asymptotic P-value =0.000014absent0.000014=0.000014= 0.000014
Kolmogorov-Smirnov’s =0.001460/σ=2.607absent0.001460𝜎2.607=0.001460/\sigma=2.607= 0.001460 / italic_σ = 2.607; asymptotic P-value =0.01827absent0.01827=0.01827= 0.01827
ATE =0.001484/σ=2.650absent0.001484𝜎2.650=-0.001484/\sigma=-2.650= - 0.001484 / italic_σ = - 2.650 (or 0.001711/σ0.001711𝜎-0.001711/\sigma- 0.001711 / italic_σ =3.056absent3.056=-3.056= - 3.056 after having averaged over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 6: Have kidney disease (rather than do not) versus BMI for those tested for HIV compared with those not tested. Making sense for scores less than 20 of the large difference between those tested and those not tested is very difficult using only the reliability diagrams, whereas the cumulative plot is crystal clear.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 3,985 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 121,203
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 232,734
n=𝑛absentn=italic_n = 79,533
Kuiper’s statistic =0.002713/σ=4.713absent0.002713𝜎4.713=0.002713/\sigma=4.713= 0.002713 / italic_σ = 4.713; the asymptotic P-value =0.0000097absent0.0000097=0.0000097= 0.0000097
Kolmogorov-Smirnov’s =0.001892/σ=3.286absent0.001892𝜎3.286=0.001892/\sigma=3.286= 0.001892 / italic_σ = 3.286; asymptotic P-value =0.002034absent0.002034=0.002034= 0.002034
ATE =0.002477/σ=4.303absent0.002477𝜎4.303=-0.002477/\sigma=-4.303= - 0.002477 / italic_σ = - 4.303 (or 0.001586/σ0.001586𝜎-0.001586/\sigma- 0.001586 / italic_σ =2.756absent2.756=-2.756= - 2.756 after having averaged over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 7: Have kidney disease (rather than do not) versus BMI for those tested for HIV compared with those not tested, with scores randomized from a different random seed (namely, 54321) than that for Figure 6 (which used 543216789). As in Figure 6, making sense for scores below 20 of the big difference between those tested and those not tested is hard using only the reliability diagrams, whereas the cumulative plots are clear.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 3,985 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 121,203
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 232,734
n=𝑛absentn=italic_n = 79,551
Kuiper’s statistic =0.003108/σ=5.214absent0.003108𝜎5.214=0.003108/\sigma=5.214= 0.003108 / italic_σ = 5.214; the asymptotic P-value =0.0000007absent0.0000007=0.0000007= 0.0000007
Kolmogorov-Smirnov’s =0.001828/σ=3.066absent0.001828𝜎3.066=0.001828/\sigma=3.066= 0.001828 / italic_σ = 3.066; asymptotic P-value =0.004335absent0.004335=0.004335= 0.004335
ATE =0.001858/σ=3.117absent0.001858𝜎3.117=-0.001858/\sigma=-3.117= - 0.001858 / italic_σ = - 3.117 (or 0.001662/σ0.001662𝜎-0.001662/\sigma- 0.001662 / italic_σ =2.788absent2.788=-2.788= - 2.788 after having averaged over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 8: Have kidney disease (rather than do not) versus BMI for those tested for HIV compared with those not tested, with scores randomized from a different random seed (namely, 6789) than that for Figures 6 and 7. The cumulative plot here is as clear as in Figures 6 and 7, whereas making sense for scores less than 20 of the large difference between those tested and those not tested is hard using only the reliability diagrams.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 105 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 193,659
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 202,667
n=𝑛absentn=italic_n = 45,370
Kuiper’s statistic =0.5027/σ=19.88absent0.5027𝜎19.88=0.5027/\sigma=19.88= 0.5027 / italic_σ = 19.88; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
Kolmogorov-Smirnov’s =0.5021/σ=19.86absent0.5021𝜎19.86=0.5021/\sigma=19.86= 0.5021 / italic_σ = 19.86; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
ATE =0.9044/σ=35.77absent0.9044𝜎35.77=0.9044/\sigma=35.77= 0.9044 / italic_σ = 35.77 (or 0.8513/σ=33.670.8513𝜎33.670.8513/\sigma=33.670.8513 / italic_σ = 33.67 following averaging over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Refer to caption
(g)
Refer to caption
(h)
Figure 9: BMI versus height in centimeters for men compared with women. The scores for Figures 911 are heights instead of the BMIs used as scores for the earlier figures; BMIs are still used here, but now as responses rather than scores. Of course, height need not “cause” the associated BMI, but a causal connection seems more plausible with BMI depending on height rather than BMI “causing” height. Much higher BMI for men than for women who are equally very short jumps out of the cumulative plots. Assessing how much higher is trivial from the slope of the cumulative graph yet very tricky to divine from the reliability diagrams.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 105 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 193,659
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 202,667
n=𝑛absentn=italic_n = 45,489
Kuiper’s statistic =0.5165/σ=20.29absent0.5165𝜎20.29=0.5165/\sigma=20.29= 0.5165 / italic_σ = 20.29; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
Kolmogorov-Smirnov’s =0.5165/σ=20.29absent0.5165𝜎20.29=0.5165/\sigma=20.29= 0.5165 / italic_σ = 20.29; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
ATE =0.8183/σ=32.15absent0.8183𝜎32.15=0.8183/\sigma=32.15= 0.8183 / italic_σ = 32.15 (or 0.8603/σ=33.800.8603𝜎33.800.8603/\sigma=33.800.8603 / italic_σ = 33.80 following averaging over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Refer to caption
(g)
Refer to caption
(h)
Figure 10: BMI versus height in centimeters for men compared with women, with scores perturbed at random starting with a random seed, 54321, that is different from the seed used in Figure 9 (which was 543216789). The scores for these figures are heights, instead of the BMIs used as scores for the earlier figures; here, BMIs are responses rather than scores. As in Figure 9, the cumulative plot readily reveals much higher BMI for men than for women who report to be extremely short. The slope of the cumulative graph quantifies how much higher, whereas the reliability diagrams are hard to interpret for the very small scores.
Refer to caption
(a)

m=𝑚absentm=italic_m = 396,326 (with 105 distinct scores prior to randomization)
n0=subscript𝑛0absentn_{0}=italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 193,659
n1=subscript𝑛1absentn_{1}=italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 202,667
n=𝑛absentn=italic_n = 45,379
Kuiper’s statistic =0.4851/σ=18.62absent0.4851𝜎18.62=0.4851/\sigma=18.62= 0.4851 / italic_σ = 18.62; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
Kolmogorov-Smirnov’s =0.4851/σ=18.62absent0.4851𝜎18.62=0.4851/\sigma=18.62= 0.4851 / italic_σ = 18.62; the asymptotic P-value is less than 1016superscript101610^{-16}10 start_POSTSUPERSCRIPT - 16 end_POSTSUPERSCRIPT
ATE =0.8651/σ=33.21absent0.8651𝜎33.21=0.8651/\sigma=33.21= 0.8651 / italic_σ = 33.21 (or 0.8432/σ=32.360.8432𝜎32.360.8432/\sigma=32.360.8432 / italic_σ = 32.36 following averaging over 25 random infinitesimal perturbations of the original scores)

(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Refer to caption
(g)
Refer to caption
(h)
Figure 11: BMI versus height in centimeters for men compared with women, with scores perturbed at random using a different random seed (namely, 6789) than the seeds used in Figures 9 and 10. The scores for this figure and the other two are heights, instead of the BMIs used as scores for all other figures; BMIs are still used here, but as responses rather than scores. As in Figures 9 and 10, the cumulative plot reveals at a glance much higher BMI for men than for women who report being very, very short. The slope of the cumulative graph clearly quantifies how much higher, which is difficult to assess via the reliability diagrams.
github-actions[bot] commented 1 day ago

Hello @tygert, thanks for the issue report! We are reviewing your report and will address it as soon as possible.