privacy-tech-lab / privacy-pioneer-web-crawler

Web crawler for detecting websites' data collection and sharing practices at scale using Privacy Pioneer
https://privacytechlab.org/
MIT License
0 stars 0 forks source link

Update readme #22

Closed SebastianZimmeck closed 3 months ago

SebastianZimmeck commented 3 months ago

Currently, the readme seems outdated as we decided to use the cloud crawler. @dadak-dom, can you add the cloud crawl instructions and remove everything outdated? @danielgoldelman can help with any questions.

dadak-dom commented 3 months ago

Readme has been updated 👍

SebastianZimmeck commented 3 months ago

There are a number of points that are not clear to me or that otherwise need to be fixed (e.g., formatting). I do not think anybody outside of our team can look at this and set this up or even understand what is going on. @dadak-dom, take a look at other readmes for examples, e.g., Privacy Pioneer or the GPC Web Crawler, Here are the main points:

Please take another pass @dadak-dom and spend some time to make the readme good. If there is anyything that you do not understand, clarify it or remove it. If there is anything that is outdated, it should be removed. Imagine you know nothing about Privacy Pioneer and you just found this code because you searched "data collection and sharing open source code" or something like that, and now you try to make sense of that and set this up. What do you need to know?

dadak-dom commented 3 months ago

Got it, will work on fixes. Thank you for the feedback @SebastianZimmeck 👍

SebastianZimmeck commented 3 months ago

Great! Thank you, @dadak-dom!

SebastianZimmeck commented 3 months ago

@natelevinson10 will also check if the updated readme is understandable.

SebastianZimmeck commented 3 months ago

Another point for the readme.