[feat] Add shape operation tools

Layout Parser now comes with better support for various shape operations. Built based on the generalized_connected_component_analysis_1d function, the simple_line_detection can be used for detecting text lines based on the token textblocks:

import layout parser as lp
page_layout = lp.load_pdf("tests/fixtures/io/example.pdf")[0]
pdf_lines = lp.simple_line_detection(page_layout)

Token Visualization	Text Line Visualization

Layout-Parser / layout-parser

[feat] Add shape operation tools #72