In most cases, the actual loading of the document is done by parsel, not Scrapy.
form.py was 1 exception, but I have refactored it to use the response selector instead. The application of get_base_url in unified.py was a bug fix detected in the process.
Another exception is iterparse_lxml, which now uses resolve_entities=False as parsel does.
And then there was the sitemap code, which was already disabling entity resolution.
In most cases, the actual loading of the document is done by parsel, not Scrapy.
form.py
was 1 exception, but I have refactored it to use the response selector instead. The application ofget_base_url
inunified.py
was a bug fix detected in the process.Another exception is
iterparse_lxml
, which now usesresolve_entities=False
as parsel does.And then there was the sitemap code, which was already disabling entity resolution.