Closed G-Slient closed 4 months ago
Thank you for your contribution!
However I am afraid we cannot do it using this approach:
A more complete check would need to scan through all document pages and try to understand whether there are headers / footer at all e.g. by looking for content similarities equal / similar positions on page, etc.
This is quite a complex undertaking - actually belonging in the hands of some upstream AI ...
I think best is to introduce page margins as a parameter. E.g. margins=(left, top, right, bottom)
. With the option to also just specify margins=(top, bottom)
or margins=50
(apply to all 4 borders).
From these values, we would compute a clip rectangle and ignore every outside it.