Open abrichr opened 1 year ago
for the image
param would it be called using the image
property in Screenshots
? i.e if we have a Screenshot object then, get_layout(screenshot.image())
?
followup question: why do we wrap the image as an np array? Is it to leverage numpy's speed when moving the image around?
Edit: nvm I just looked at PIL and Image documentation :D
In order to support https://github.com/MLDSAI/OpenAdapt/issues/157, we want to extract structured information from documents.
See https://huggingface.co/docs/transformers/model_doc/layoutlm for implementation (alternatives?)
We want a
LayoutExtractionReplayStrategyMixin
that implements: