I just suddenly made progress. Might not have needed to open this issue. Will close for now.
Original post:
I have some borderless tables where the rows have a very small separation, and the output sometimes has several rows in one.
But, those bunched cells there are "\n", so it seems like separating them would have been possible.
I'm not seeing any options in the API to tweak this. If there are any, could you let me know?
Or something I can change in extract_tables or identify_borderless_tables functions to test with tables like this one?
Here's the original image and a screenshot of the HTML output:
EDIT:
I just suddenly made progress. Might not have needed to open this issue. Will close for now.
Original post:
I have some borderless tables where the rows have a very small separation, and the output sometimes has several rows in one. But, those bunched cells there are "\n", so it seems like separating them would have been possible.
I'm not seeing any options in the API to tweak this. If there are any, could you let me know?
Or something I can change in
extract_tables
oridentify_borderless_tables
functions to test with tables like this one?Here's the original image and a screenshot of the HTML output: