InQuest / ThreatIngestor

Extract and aggregate threat intelligence.
https://inquest.readthedocs.io/projects/threatingestor/
GNU General Public License v2.0
832 stars 135 forks source link

Add URL based allowlist filter on sources #148

Closed dspruell-i01 closed 1 year ago

dspruell-i01 commented 1 year ago

Feature request: implement capability to filter ingested pages from sources like blogs/sitemaps/etc. based on URL patterns.

Use case:

The result should be that all content from the source will be ingested, except resources matching the specified URL pattern. This should have precedence over other configuration options that might specify keywords to match for ingestion, for example.