tw4l / bulk-reviewer

DEPRECATED. Replaced with Electron desktop application: https://github.com/bulk-reviewer/bulk-reviewer
GNU Affero General Public License v3.0
13 stars 1 forks source link

Add option to ignore whitelisted URL/domain results #36

Closed tw4l closed 5 years ago

tw4l commented 5 years ago

e.g. ns.adobe.com (PDF), purl.org (Dublin Core), schemas.openxmlformats.org (OOXML)

tw4l commented 5 years ago

schemas.microsoft.org

tw4l commented 5 years ago

w3.org

tw4l commented 5 years ago

bulk_extractor stoplists for known/safe URLs, domains, email address, and CCNs added in commit https://github.com/timothyryanwalsh/bulk-reviewer/commit/aaa823a5f5957e688ece26a40d5d8e529c50205d