privacy-tech-lab / gpc-web-crawler

Web crawler for detecting websites' compliance with GPC privacy preference signals at scale
https://privacytechlab.org/
MIT License
4 stars 2 forks source link

Clean up Google Drive files #111

Closed SebastianZimmeck closed 3 months ago

SebastianZimmeck commented 4 months ago

Reminder for me.

franciscawijaya commented 3 months ago

I will be adding documentation for the files in Google Drive (ie. the 8-batches and other Misc. files). There are also some files that I might need to confirm. I could then either put it in the ReadME or wiki (to be determined).

franciscawijaya commented 3 months ago

I have updated the wiki and finished writing my documentation for the google drive. Unless there is anything else that needs to be done on the google drive directly, I think this issue can be closed.

SebastianZimmeck commented 3 months ago

Thanks, @franciscawijaya!

Can you start with the file and directory names and make them visually more clear as, for example, in the OptMeowt repo readme? That would make it a bit easier to grasp. Essentially, you have the content, but the formatting makes it a bit difficult to follow.

franciscawijaya commented 3 months ago

Done! I've cleaned up the formatting.

SebastianZimmeck commented 3 months ago

Thanks, @franciscawijaya!

I made a few smaller modifications.

Can you add descriptions of the Crawl_Data and sites_with_GPP files?

franciscawijaya commented 3 months ago

Got it! I've added descriptions for both Crawl_Data and sites_with_GPP.

SebastianZimmeck commented 3 months ago

Looks good!