Support for starting path property for Crawler

Description Crawler currently always begins crawling from the root of the domain specified in the domain configuration property. Sometimes it is useful to begin crawling a site from a sub-page/path. The crawler would start with that page so that pages linked from there would appear at the top of the list of URLs

Proposed solution Provide a configuration property e.g. starting_path that allows someone to specify a path from which to begin crawling, rather than always starting to crawl from the / root page.

salsadigitalauorg / merlin-framework

Support for starting path property for Crawler #83