gsireesh / ht-max

Code for the HT-MAX project
Apache License 2.0
0 stars 1 forks source link

Add extracted highlights to papermage doc #2

Closed gsireesh closed 7 months ago

gsireesh commented 8 months ago

Closing this issue will involve combining both @TianhaoHarryZhang's and @kamurphy11's scripts to extract the highlights from PDFs into a single function that follows our uniform function interface.

This should produce a list of PaperMage Entities that we can add to the document that's passed in.

TianhaoHarryZhang commented 8 months ago

Do we define a new class "Entity" or inherit from an available base class?