tabulapdf / tabula-java

Extract tables from PDF files
MIT License
1.82k stars 425 forks source link

Where cells are merged in some rows but not others, I end up with too few columns in those rows. #525

Open annevjenkins42 opened 1 year ago

annevjenkins42 commented 1 year ago

I have an issue with merged cells where the cells are merged in some rows but not others. I end up getting one cell where 3 are merged and my columns no longer align in the rows where there are merged cells. It would be helpful if the code could treat a table where the merged cells correspond to multiple columns in the header row as multiple columns, either all with the content of the merged cells or with the first getting the content and the rest blank.