allenai / mmda

multimodal document analysis
Apache License 2.0
159 stars 18 forks source link

Implementing heuristics for fixing Vila token predictions based on Layoutparser bounding box predictions #198

Closed egork520 closed 1 year ago