atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.64k stars 354 forks source link

Generate table with spanning cells #302

Closed WindChaserZ closed 5 years ago

WindChaserZ commented 5 years ago

I am trying to generate an html table with spanning cells. After parsing out the table, each cell has rowspan and colspan information. However, after calling the pandas package and converting it into a DataFrame, the position and spanning information of each cell is lost. So the resulting form does not have spanning cells. Could you please add a function to generate a table with spanning cell?

vinayak-mehta commented 5 years ago

Can you share the PDF and the code you wrote that can help me reproduce this. copy_text kwarg (you can find out about it in the advanced docs) in a way preserve that spanning cell information.

vinayak-mehta commented 5 years ago

Closing due to inactivity.