In dhs-ncats/gatherer we do some cleaning up of the domains returned by this project's gather pipeline. (For example, we remove \r characters that appear in the middle of host names and we condense repeated . characters into a single one.) Does it make sense to do some of those things here instead?
In dhs-ncats/gatherer we do some cleaning up of the domains returned by this project's
gather
pipeline. (For example, we remove\r
characters that appear in the middle of host names and we condense repeated.
characters into a single one.) Does it make sense to do some of those things here instead?