OSINT-TECHNOLOGIES / dpulse

DPULSE - Tool for complex approach to domain OSINT
https://pypi.org/project/dpulse/
MIT License
82 stars 4 forks source link

Add FileNotFoundError handler in robots/sitemap extraction function #45

Closed OSINT-TECHNOLOGIES closed 5 months ago

OSINT-TECHNOLOGIES commented 5 months ago

sitemap_links_status = np.extract_links_from_sitemap(sitemap_links_filepath, sitemap_filepath, 'pdf') [data_assembler]

which then calls:

tree = ET.parse(sitemap_path) [networking processor, extract_links_from_sitemap function]

may cause FileNotFoundError and stop scan entirely