Open meritor opened 5 years ago
Can you upload the PDF here so we can use it to reproduce the error? It must be free to use, because we will use it in the test suite.
Furthermore, could you move your raw text output to a Gist, because its hard to get an overview with this much text. Thank you.
Hi,
I have this PDF exported from Microsoft Excel that does not work well when that's been read by pdfparser. I did bit more investigation and found that the Tj with same line positions are not always in same sequence. e.g The line is:
Number | text
123456789 | Approved
223456789 | Refused
323456789 | Approved
423456789 | Refused
When above text is read, its getting as: Approved Refused Approved Refused 423456789
123456789
223456789
323456789
However, the Tm position looked looked correct.. so.. in short all we need to do is read the Tm first and then arrange the text?
109.64 676.96 m 109.64 34.62 l S 109.58 34.56 0.96 642.46 re f*
198.83 676.96 m 198.83 34.62 l S 198.77 34.56 0.95999 642.46 re f*
29.34 677.92 m 199.67 677.92 l S 29.28 677.02 170.45 0.96001 re f*
29.34 665.56 m 199.67 665.56 l S 29.28 664.66 170.45 0.96001 re f*
29.34 653.2 m 199.67 653.2 l S 29.28 652.3 170.45 0.96001 re f*
29.34 640.84 m 199.67 640.84 l S 29.28 639.94 170.45 0.96001 re f*
29.34 628.45 m 199.67 628.45 l S 29.28 627.55 170.45 0.95999 re f*
29.34 616.09 m 199.67 616.09 l S 29.28 615.19 170.45 0.95999 re f*
29.34 603.73 m 199.67 603.73 l S 29.28 602.83 170.45 0.95999 re f*
29.34 591.37 m 199.67 591.37 l S 29.28 590.47 170.45 0.95999 re f*
29.34 579.01 m 199.67 579.01 l S 29.28 578.11 170.45 0.96002 re f*
29.34 566.65 m 199.67 566.65 l S 29.28 565.75 170.45 0.95999 re f*
29.34 554.29 m 199.67 554.29 l S 29.28 553.39 170.45 0.96002 re f*
29.34 541.93 m 199.67 541.93 l S 29.28 541.03 170.45 0.95999 re f*
29.34 529.57 m 199.67 529.57 l S 29.28 528.67 170.45 0.96002 re f*
29.34 517.21 m 199.67 517.21 l S 29.28 516.31 170.45 0.95999 re f*
29.34 504.85 m 199.67 504.85 l S 29.28 503.95 170.45 0.96002 re f*
29.34 492.49 m 199.67 492.49 l S 29.28 491.59 170.45 0.95999 re f*
29.34 480.13 m 199.67 480.13 l S 29.28 479.23 170.45 0.95999 re f*
29.34 467.77 m 199.67 467.77 l S 29.28 466.87 170.45 0.95999 re f*
29.34 455.41 m 199.67 455.41 l S 29.28 454.51 170.45 0.95999 re f*
29.34 443.05 m 199.67 443.05 l S 29.28 442.15 170.45 0.95999 re f*
29.34 430.67 m 199.67 430.67 l S 29.28 429.77 170.45 0.95999 re f*
29.34 418.31 m 199.67 418.31 l S 29.28 417.41 170.45 0.95999 re f*
29.34 405.95 m 199.67 405.95 l S 29.28 405.05 170.45 0.95999 re f*
29.34 393.59 m 199.67 393.59 l S 29.28 392.69 170.45 0.95999 re f*
29.34 381.23 m 199.67 381.23 l S 29.28 380.33 170.45 0.95999 re f*
29.34 368.87 m 199.67 368.87 l S 29.28 367.97 170.45 0.95999 re f*
29.34 356.51 m 199.67 356.51 l S 29.28 355.61 170.45 0.96002 re f*
29.34 344.15 m 199.67 344.15 l S 29.28 343.25 170.45 0.95999 re f*
29.34 331.79 m 199.67 331.79 l S 29.28 330.89 170.45 0.96002 re f*
29.34 319.43 m 199.67 319.43 l S 29.28 318.53 170.45 0.96002 re f*
29.34 307.07 m 199.67 307.07 l S 29.28 306.17 170.45 0.96002 re f*
29.34 294.71 m 199.67 294.71 l S 29.28 293.81 170.45 0.96002 re f*
29.34 282.35 m 199.67 282.35 l S 29.28 281.45 170.45 0.95996 re f*
29.34 269.99 m 199.67 269.99 l S 29.28 269.09 170.45 0.96002 re f*
29.34 257.63 m 199.67 257.63 l S 29.28 256.73 170.45 0.96002 re f*
29.34 245.24 m 199.67 245.24 l S 29.28 244.34 170.45 0.95996 re f*
29.34 232.88 m 199.67 232.88 l S 29.28 231.98 170.45 0.96002 re f*
29.34 220.52 m 199.67 220.52 l S 29.28 219.62 170.45 0.96002 re f*
29.34 208.16 m 199.67 208.16 l S 29.28 207.26 170.45 0.96002 re f*
29.34 195.8 m 199.67 195.8 l S 29.28 194.9 170.45 0.95996 re f*
29.34 183.44 m 199.67 183.44 l S 29.28 182.54 170.45 0.95996 re f*
29.34 171.08 m 199.67 171.08 l S 29.28 170.18 170.45 0.96002 re f*
29.34 158.72 m 199.67 158.72 l S 29.28 157.82 170.45 0.96002 re f*
29.34 146.36 m 199.67 146.36 l S 29.28 145.46 170.45 0.95996 re f*
29.34 134 m 199.67 134 l S 29.28 133.1 170.45 0.95996 re f*
29.34 121.64 m 199.67 121.64 l S 29.28 120.74 170.45 0.96002 re f*
29.34 109.28 m 199.67 109.28 l S 29.28 108.38 170.45 0.96002 re f*
29.34 96.924 m 199.67 96.924 l S 29.28 96.024 170.45 0.95996 re f*
29.34 84.564 m 199.67 84.564 l S 29.28 83.664 170.45 0.95996 re f*
29.34 72.204 m 199.67 72.204 l S 29.28 71.304 170.45 0.96002 re f*
29.34 59.82 m 199.67 59.82 l S 29.28 58.92 170.45 0.96002 re f*
29.34 47.46 m 199.67 47.46 l S 29.28 46.56 170.45 0.95996 re f* Q