:information_source: Welcome Mozilla Global Sprinters! :wave: :tada: :confetti_ball:
Please join our Gitter channel to start discussing the project! For more information on how to contribute, we encourage you to have a look at this README, and to check the issues.
Privacy policies are a legal requirement for websites handling users' data. So anyone should be able to access them, read them and understand what it takes (in terms of privacy) to be using a given service. Except no one reads them :see_no_evil: . User's fault? Probably not. We can list several reasons:
In short, they usually aren't designed for people to read and understand. But still, the content of these policies is very important to anyone's privacy, for this is where you should learn what private data you agree to give away.
Privacy Bot is a project which aims at addressing these aforementioned issues. If privacy policies are not meant for humans, then perhaps we can design a bot to automatically do the heavy lifting for us. The high level goals of the project are to:
diffs
on updates for free).You can find the current privacy policies in the privacy_policies
folder. In
the future, we should probably host them on a separate branch to not mix the
code and the data.
To get going with the project as a contributor, it is recommended to install the
package in 'developer mode' using pip
, in a virtual environment. You also need
Python 3
.
$ pip install -e .
To analyze privacy policies, make sure you install the requirements-analysis.txt
.
$ pip install -r requirements-analysis.txt
To load all privacy policies, you can use the policies_snapshot_api or for an example notebook, go here.
For another example analysis, have a look at the word relevance analysis
There are two entry points, used respectively for:
$ find_policies --urls domains.txt
# Outputs: policy_url_candidates.json
$ fetch_policies policy_url_candidates.json
# Outputs: index.json and privacy_policies/
Keep in mind that the file formats are still a work in progress, and will likely evolve in the near future. Feel free to contribute with ideas and improvements!
Thanks for your interest in contributing to Privacy Bot! There are many ways to contribute. To get started, take a look at CONTRIBUTING.md.
This project adheres to a code of conduct. By participating, you are expected to uphold this code. Please report unacceptable behavior to michel@cliqz.com.
TODO - move to issues tracker.
Join us at the Mozilla Global Sprint June 1-2, 2017! We'll be gathering in-person at sites around the world and online to collaborate on this project and learn from each other. Get your #mozsprint tickets now!