pirate / sites-using-cloudflare

:broken_heart: Archived list of domains using Cloudflare DNS at the time of the CloudBleed announcement.
1.92k stars 320 forks source link

duplicate entries #94

Closed nickhilliard closed 7 years ago

nickhilliard commented 7 years ago
% cat sorted_unique_cf.txt | tr '[A-Z]' '[a-z]' | sort | uniq -d | wc -l
    1213
%

this should return zero.

nevinera commented 7 years ago

Yeah, he documented that he used | uniq | sort, which I thought was a typo. A similar (simpler) comparison:

$ cat sites-using-cloudflare-master/sorted_unique_cf.txt  | sort | uniq | wc -l
 4287625
$ cat sites-using-cloudflare-master/sorted_unique_cf.txt  | wc -l
 4288852
coderobe commented 7 years ago

I have opened a PR with a fixed version, see https://github.com/pirate/sites-using-cloudflare/pull/99

pirate commented 7 years ago

Thanks @coderobe, will merge that shortly and fix the README methodology.