Filimoa / open-parse

Improved file parsing for LLM’s
https://filimoa.github.io/open-parse/
MIT License
2.4k stars 91 forks source link

Table Extraction Tool #58

Open xyzdeclan opened 1 month ago

xyzdeclan commented 1 month ago

Description

There is another tool for PDF table extraction recently, maybe this could be an option to embed? https://github.com/ai8hyf/TF-ID

Filimoa commented 1 month ago

I will look into this, it'd be helpful if they published more benchmarks of their work. I'm also concerned on the relatively small amount of data it was trained on.

Edit: Now realized they just did a finetune