PDF files are converted to DOCX and then tables are extracted from DOCX.
There are hidden columns and hidden text in the tables.
Is there a way to ignore the hidden columns and text during conversion?
Can the table structure be maintained during conversion of pdf to docx ignoring the hidden content and columns
Hi,
PDF files are converted to DOCX and then tables are extracted from DOCX. There are hidden columns and hidden text in the tables. Is there a way to ignore the hidden columns and text during conversion? Can the table structure be maintained during conversion of pdf to docx ignoring the hidden content and columns