InQuest / ThreatIngestor

Extract and aggregate threat intelligence.
https://inquest.readthedocs.io/projects/threatingestor/
GNU General Public License v2.0
832 stars 135 forks source link

Improves Sitemap Filtering #130

Closed battleoverflow closed 1 year ago

battleoverflow commented 1 year ago

Features

config.yml example:

sources:
  # Searches for "articles" keyword
  - name: inquest-sitemap-articles
    module: sitemap
    url: https://www.inquest.net/sitemap.xml
    filter: articles

  # Defaults to "blog" keyword
  - name: inquest-sitemap-blog
    module: sitemap
    url: https://www.inquest.net/sitemap.xml

  # Searches for "articles or security" keywords
  - name: inquest-sitemap-blog-articles-security
    module: sitemap
    url: https://www.inquest.net/sitemap.xml
    filter: articles|security

  # Specify directories in the filter
  - name: inquest-sitemap-blog-category
    module: sitemap
    url: https://www.inquest.net/sitemap.xml
    path: /blog/category/

  # Specify filtering for paths
  # Only returns results under /blog/category/release|solutions
  - name: inquest-sitemap-release-solutions
    module: sitemap
    url: https://www.inquest.net/sitemap.xml
    path: /blog/category/
    filter: release|solutions