Both on the CLI and with Python the spider component stores and retrieves URLs which are possibly out of scope if the input URL is restricted to a portion of a domain, e.g. https://www.example.org/news/en/.
This behavior should be further investigated, tested and/or improved.
Both on the CLI and with Python the spider component stores and retrieves URLs which are possibly out of scope if the input URL is restricted to a portion of a domain, e.g.
https://www.example.org/news/en/
.This behavior should be further investigated, tested and/or improved.