unhcr / koboloadeR

deprecated please use {kobocruncher} - https://edouard-legoupil.github.io/kobocruncher/
https://unhcr.github.io/koboloadeR/docs/
28 stars 28 forks source link

Develop kobo_anonymise #5

Closed Edouard-Legoupil closed 5 years ago

Edouard-Legoupil commented 6 years ago

See some brainstorming on the expected scope for the function: https://github.com/unhcr/koboloadeR/blob/master/R/kobo_anonymise.R

Edouard-Legoupil commented 6 years ago

The initial scope of this function is basic anonymisation (i.e. removal of direct identifiers...) so that the dataset can be manipulated by approved researcher...

In order to release the dataset more widely, there might be links to be created with more advanced tool like SDCmicro - http://www.ihsn.org/anonymization - and it's shiny app:

library(sdcMicro)
sdcApp()
Edouard-Legoupil commented 6 years ago

Need to integrate some functions from sdcMicro in order to perform k-anonymity and l-senstivity or Special Uniques Detection Algorithm (SUDA) on the dataset - the idea would be to produce a data anonymisation analysis report that will provide some benchmark in terms of disclosure risk vs information loss balance. Users will be able to apply further statistical disclosure control method till they get an acceptable anonymisation level

Edouard-Legoupil commented 6 years ago

Work in progress here: https://github.com/unhcr/koboloadeR/blob/master/R/kobo_anonymisation_report.R