Closed janagombitova closed 4 years ago
@janagombitova Do you want to write up some tooltip helper text to show?
@muloem if it means that we also fix the bug around the tooltips, then yes :)
ok. put together the text and I will bring up that branch that I had fixing the tooltip 👍
Expose this flag in the API https://github.com/akvo/akvo-flow-api/pull/223
@muloem how about this Tooltip: Define that answers to this question are personal data (personally identifiable information - any data that can be used to identify a specific individual). When exporting the data via the Flow API, you can filter out the values based on this flag.
@marvinkome and @tangrammer is there more that needs to get done here? I am moving the tooltip to a separate issue (https://github.com/akvo/akvo-flow/issues/3593) that can be handled later. If besides that, all is done, let's release the change.
Nothing from the UI side
Released on June 2
Context
As partners are more and more thinking about best data practices and trying to limit the amount of personal data they collect (due to GDPR) we see an increase in questions around How can I know which data columns (questions) hold personal data?. Also, many Terms of Reference for proposals hold requirements like: identification of personal data, distraction of personal data from the dataset.
Today there is no way to know which column holds personal data besides looking at the question text (column header) and the data itself. But often, already having to look at the data is something partners want to avoid, especially when data is being exported out of Flumen (as Flumen handles data access with good roles and permissions).
Why do we have this issue? What are we trying to solve?
There is a lot of work that can be done to make it easier to work/not work with personal data. But our main goal is to allow users to work with Flumen without changing too much. We want to make it possible to identify personal data once the data is out of Flow via the API, so they can hide/mask/remove the data based on their needs.
Why not more?
Because we do not know what exactly the reasons are for hiding/masking such columns in the dataset. Will they analyse the data without these columns? Why collect them in the first place? Who should not see this data and why should this person see the rest of the dataset?
Goal
The goal of this change is to implement the minimum, so we can learn more and then build up on what we learn.
How will this benefit the users?
How will this benefit Akvo?
Current status quo
Currently, there is no way to say this question will hold personal data.
Opportunity
There are many opportunities underlying this issue:
But we will ONLY go for the minimum for now to learn.
The Idea = The minimum
personal data
Next steps