Closed wbcbugfree closed 1 week ago
We need to parse the PDF report to a machine-readable format to facilitate the Python script reading the file. Here are some existing tools:
A predictable problem is the splicing of the same paragraph across pages. A paragraph may be split into two paragraphs due to a page break, and the vast majority of current tools are unable to solve this problem, which may require manual inspection.
See report.
We need to parse the PDF report to a machine-readable format to facilitate the Python script reading the file. Here are some existing tools:
A predictable problem is the splicing of the same paragraph across pages. A paragraph may be split into two paragraphs due to a page break, and the vast majority of current tools are unable to solve this problem, which may require manual inspection.