ibm-aur-nlp / PubTabNet

Other
380 stars 79 forks source link

Rotated Text Headers #30

Open zdan12 opened 5 months ago

zdan12 commented 5 months ago

Hi, thank you for the awesome work!

I'm using this data to train my own model. The domain I'm interested in running inference on includes tables with rotated text in their headers, like this: https://css-tricks.com/rotated-table-column-headers/

I spot checked many random examples in your dataset, and I didn't come across any table images with this feature. Do you know if they're in the data and just rare? Or have they been purposely excluded? If they aren't in there, I'd like to augment my training with examples, which may need to be synthetic. Let me know.

ajjimeno commented 5 months ago

Hi zdan12, there are many tables and some might contain what you are after, but I do not have any specific example. Have a look as well at the related data set FinTabNet ( https://developer.ibm.com/exchanges/data/all/fintabnet/), in case it might be helpful as well for table processing.

On Wed, Jan 17, 2024 at 11:02 AM zdan12 @.***> wrote:

Hi, thank you for the awesome work!

I'm using this data to train my own model. The domain I'm interested in running inference on includes tables with rotated text in their headers, like this: https://css-tricks.com/rotated-table-column-headers/

I spot checked many random examples in your dataset, and I didn't come across any table images with this feature. Do you know if they're in the data and just rare? Or have they been purposely excluded? If they aren't in there, I'd like to augment my training with examples, which may need to be synthetic. Let me know.

— Reply to this email directly, view it on GitHub https://github.com/ibm-aur-nlp/PubTabNet/issues/30, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA6BZDLW3Z7ADZ5ABJMUZ7LYO4IKTAVCNFSM6AAAAABB5UTGVOVHI2DSMVQWIX3LMV43ASLTON2WKOZSGA4DKMJVHA4DMMQ . You are receiving this because you are subscribed to this thread.Message ID: @.***>