Closed lolipopshock closed 2 years ago
Layout Parser now comes with better support for various shape operations. Built based on the generalized_connected_component_analysis_1d function, the simple_line_detection can be used for detecting text lines based on the token textblocks:
generalized_connected_component_analysis_1d
simple_line_detection
textblock
import layout parser as lp page_layout = lp.load_pdf("tests/fixtures/io/example.pdf")[0] pdf_lines = lp.simple_line_detection(page_layout)
Layout Parser now comes with better support for various shape operations. Built based on the
generalized_connected_component_analysis_1d
function, thesimple_line_detection
can be used for detecting text lines based on the tokentextblock
s: