arXiv / html_feedback

Supports a student project developing a UI for feedback on arXiv articles rendered as html.
MIT License
18 stars 3 forks source link

Figure 3 blank #1746

Open mjhoover1 opened 4 months ago

mjhoover1 commented 4 months ago

Description

Figure 3 doesn't appear for some reason.

(Optional:) Please add any files, screenshots, or other information here.

No response

(Required) What is this issue most closely related to? Select one.

Choose One

Internal issue ID

a6947939-b5de-4cc2-9506-45ab0f96b7d6

Paper URL

https://arxiv.org/html/2403.10094v1

Browser

Chrome/126.0.0.0

Device Type

Desktop

html-feedback-bot[bot] commented 4 months ago

Location in document: undefined

Selected HTML:

The range image presents LiDAR data compactly and intuitively, with rows indicating the laser beams and columns representing the yaw angles. We convert point clouds to range images using spherical projection. Typically, for a point 𝐩𝐩\mathbf{p}bold_p in Cartesian coordinates (x,y,z)𝑥𝑦𝑧(x,y,z)( italic_x , italic_y , italic_z ), we calculate its spherical coordinates (r,θ,ϕ)𝑟𝜃italic-ϕ(r,\theta,\phi)( italic_r , italic_θ , italic_ϕ ) using:

r=x2+y2+z2,θ=atan(y,x),ϕ=atan(z,x2+y2).formulae-sequence𝑟superscript𝑥2superscript𝑦2superscript𝑧2formulae-sequence𝜃atan𝑦𝑥italic-ϕatan𝑧superscript𝑥2superscript𝑦2r=\sqrt{x^{2}+y^{2}+z^{2}},\theta=\operatorname{atan}(y,x),\phi=\operatorname{% atan}\left(z,\sqrt{x^{2}+y^{2}}\right).\vspace{-6pt}italic_r = square-root start_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , italic_θ = roman_atan ( italic_y , italic_x ) , italic_ϕ = roman_atan ( italic_z , square-root start_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) . (4)

However, in most current datasets such as KITTI-360, multiple lasers from the Velodyne LiDAR system do not share a common origin for their measurements. This may introduce errors in the direct conversion from Cartesian points to spherical points, resulting in incorrect range-view data distribution and thus low-quality range images, as shown at the top of Fig. 3.

To address this issue, we adopt Hough Voting to estimate heights and pitch angles {hj,ϕj}j=1,,Nsubscriptsubscript𝑗subscriptitalic-ϕ𝑗𝑗1𝑁\{h_{j},\phi_{j}\}_{j=1,...,N}{ italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 , … , italic_N end_POSTSUBSCRIPT for Velodyne sensors [3]. We then adjust the point cloud transformation to a range image using

r=x2+y2+(zhj)2,θ=atan(y,x),ϕ=ϕj,formulae-sequence𝑟superscript𝑥2superscript𝑦2superscript𝑧subscript𝑗2formulae-sequence𝜃atan𝑦𝑥italic-ϕsubscriptitalic-ϕ𝑗r=\sqrt{x^{2}+y^{2}+(z-h_{j})^{2}},\theta=\operatorname{atan}(y,x),\phi=\phi_{% j},\vspace{-6pt}italic_r = square-root start_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_z - italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , italic_θ = roman_atan ( italic_y , italic_x ) , italic_ϕ = italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , (5)

where hjsubscript𝑗h_{j}italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and ϕjsubscriptitalic-ϕ𝑗\phi_{j}italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT refer to the j𝑗jitalic_j-th Velodyne sensor.

We then rasterize points (r,θ,ϕ)𝑟𝜃italic-ϕ(r,\theta,\phi)( italic_r , italic_θ , italic_ϕ ) into a 2D cylindrical projection R(u,v)𝑅𝑢𝑣R(u,v)italic_R ( italic_u , italic_v ) (a.k.a., range image) of size H×W𝐻𝑊H\times Witalic_H × italic_W with

u=((θ+π)/2π)W,v=j,formulae-sequence𝑢𝜃𝜋2𝜋𝑊𝑣𝑗u=\left(\left(\theta+\pi\right)/2\pi\right)W,v=j,\vspace{-6pt}italic_u = ( ( italic_θ + italic_π ) / 2 italic_π ) italic_W , italic_v = italic_j , (6)

where (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) denotes the grid coordinate of a point in the range image. Thus, we obtain high-quality range images as illustrated at the bottom of Fig. 3. We denote the obtained range image as 𝐱H×W×2𝐱superscript𝐻𝑊2\mathbf{x}\in\mathbb{R}^{H\times W\times 2}bold_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_H × italic_W × 2 end_POSTSUPERSCRIPT, which comprises H×W𝐻𝑊H\times Witalic_H × italic_W pixels associated with both range and intensity {r,i}𝑟𝑖\{r,i\}{ italic_r , italic_i }.

github-actions[bot] commented 4 months ago

Hello @mjhoover1, thanks for the issue report! We are reviewing your report and will address it as soon as possible.