akvo / akvo-flow

A data collection and monitoring tool that works anywhere.
http://akvo.org/products/akvoflow/
GNU Affero General Public License v3.0
65 stars 31 forks source link

Mark a question as personal data #3576

Closed janagombitova closed 4 years ago

janagombitova commented 4 years ago

Context

As partners are more and more thinking about best data practices and trying to limit the amount of personal data they collect (due to GDPR) we see an increase in questions around How can I know which data columns (questions) hold personal data?. Also, many Terms of Reference for proposals hold requirements like: identification of personal data, distraction of personal data from the dataset.

Today there is no way to know which column holds personal data besides looking at the question text (column header) and the data itself. But often, already having to look at the data is something partners want to avoid, especially when data is being exported out of Flumen (as Flumen handles data access with good roles and permissions).

Why do we have this issue? What are we trying to solve? 

There is a lot of work that can be done to make it easier to work/not work with personal data. But our main goal is to allow users to work with Flumen without changing too much. We want to make it possible to identify personal data once the data is out of Flow via the API, so they can hide/mask/remove the data based on their needs.

Why not more?

Because we do not know what exactly the reasons are for hiding/masking such columns in the dataset. Will they analyse the data without these columns? Why collect them in the first place? Who should not see this data and why should this person see the rest of the dataset?

Goal

The goal of this change is to implement the minimum, so we can learn more and then build up on what we learn.

How will this benefit the users?

How will this benefit Akvo? 

Current status quo

Currently, there is no way to say this question will hold personal data.

Screen Shot 2020-05-25 at 10 56 45

Opportunity 

There are many opportunities underlying this issue:

But we will ONLY go for the minimum for now to learn.

The Idea = The minimum

Next steps 


muloem commented 4 years ago

@janagombitova Do you want to write up some tooltip helper text to show?

janagombitova commented 4 years ago

@muloem if it means that we also fix the bug around the tooltips, then yes :)

muloem commented 4 years ago

ok. put together the text and I will bring up that branch that I had fixing the tooltip 👍

tangrammer commented 4 years ago

Expose this flag in the API https://github.com/akvo/akvo-flow-api/pull/223

janagombitova commented 4 years ago

@muloem how about this Tooltip: Define that answers to this question are personal data (personally identifiable information - any data that can be used to identify a specific individual). When exporting the data via the Flow API, you can filter out the values based on this flag.

janagombitova commented 4 years ago

@marvinkome and @tangrammer is there more that needs to get done here? I am moving the tooltip to a separate issue (https://github.com/akvo/akvo-flow/issues/3593) that can be handled later. If besides that, all is done, let's release the change.

marvinkome commented 4 years ago

Nothing from the UI side

janagombitova commented 4 years ago

Released on June 2