For now, we can use the following steps to generate the JSON files:
Use https://croppdf.com/ to remove all unnecessary white spaces from the PDF document.
Utilize https://products.aspose.app/pdf/table-extraction to create an Excel file directly from the PDF. This is because converting it directly to CSV may result in a poor-quality output. Creating an XLS file first and then converting it will yield a better result.
Review and edit the document to eliminate unnecessary white spaces or inconsistencies.
The goal is to create an automatic pipeline that performs these steps.
JSON Parser
For now, we can use the following steps to generate the JSON files:
Use https://croppdf.com/ to remove all unnecessary white spaces from the PDF document.
Utilize https://products.aspose.app/pdf/table-extraction to create an Excel file directly from the PDF. This is because converting it directly to CSV may result in a poor-quality output. Creating an XLS file first and then converting it will yield a better result.
Review and edit the document to eliminate unnecessary white spaces or inconsistencies.
The goal is to create an automatic pipeline that performs these steps.