allenai / papermage

library supporting NLP and CV research on scientific papers
https://papermage.org
Apache License 2.0
692 stars 54 forks source link

How to use doc.blocks #56

Open guoziting112 opened 10 months ago

guoziting112 commented 10 months ago

hello, Could you teach me how to use this function? please show me some examples ,thanks

kyleclo commented 7 months ago

Hi @guoziting112 , here's an example you can see in this pull request, lemme know if this helps: https://github.com/allenai/papermage/pull/73

the result from running the visualizer over blocks should be something like image

xsank commented 6 months ago

It seems that the block is only used to predict the figure and the table

kyleclo commented 6 months ago

@xsank hmm, it shouldn't be? blocks should be any visual rectilinear region on the page, so that would include figures/tables, but also things like captions and paragraph chunks.

xsank commented 5 months ago

@xsank hmm, it shouldn't be? blocks should be any visual rectilinear region on the page, so that would include figures/tables, but also things like captions and paragraph chunks.

Thank you for your reply, I have another question. The papermage only try to detect the table position, will it add the table extraction later?