Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
3.66k
stars
262
forks
source link
Extraction: move `max_tree_size` to config file #741
Closed
adbar closed 2 weeks ago
The
max_tree_size
parameter in extraction functions cannot be set on the CLI, it's better to adjust it through thesettings.cfg
file.