gsireesh / ht-max

Code for the HT-MAX project
Apache License 2.0
0 stars 1 forks source link

Extract the abstract in the Reading Order Parser #28

Open gsireesh opened 5 months ago

gsireesh commented 5 months ago

Currently, the reading order parser skips the abstract, because VILA annotation gets the abstract well enough. That said, if we're using reading order sections for everything else, why wouldn't we also get the abstract?