AMP-SCZ / dpanonymize

Personally identifiable information remover
Apache License 2.0
0 stars 0 forks source link

dpanonymize

dpanonymize is a PII removal tool for given data types. It's mainly designed to work with lochness on a PHOENIX structured data, but it also has functionalities to take separate file and folder with predefined datatype.

Installation

Just use pip:

pip install dpanonymize

For most recent DPACC-lochness:

pip install git+https://github.com/AMP-SCZ/dpanonymize

Debugging

cd ~
git clone https://github.com/AMP-SCZ/dpanonymize
cd dpanonymize
pip install .

If you do not have lochness installed already:

pip install git+https://github.com/AMP-SCZ/lochness

Tests

cd dpanonymize/tests
./dpanonymize_test.sh

Running

or you can also select which datatype to apply PII removal

dpanon.py --phoenix_root /path/to/PHOENIX --datatype actigraphy dpanon.py --phoenix_root /path/to/PHOENIX --datatype surveys


- Execute PII removal on a single file

dpanon.py \ --in_file /path/to/surveys/file \ --out_file /path/to/PII_removed/file \ --datatype surveys


- Execute PII removal on a directory where there are multiple files of same data type

  This applies PII removal on all files under the given directory.

dpanon.py \ --in_dir /path/to/surveys/directory \ --out_dir /path/to/PII_removed/directory \ --datatype surveys



## Documentation

TBD