privacy-tech-lab / privacy-pioneer-web-crawler

Web crawler for detecting websites' data collection and sharing practices at scale using Privacy Pioneer
https://privacytechlab.org/
MIT License
1 stars 0 forks source link

Update crawl lists #36

Closed dadak-dom closed 5 months ago

dadak-dom commented 5 months ago

As per our discussion, we should probably update the crawl lists to reflect the fact that lower-ranked location specific sites tend to provide less evidence. I'll play around with what exact changes need to be done.

dadak-dom commented 5 months ago

Relevant PR has been merged, and crawl lists have been updated. We decided to keep the total sites per location at 1000 to stay within our initial 10k sites budget.