A Julia package for de-identifying CSV data sets.
Pkg.add("DeIdentification")
There are a few subtle points that must be kept in mind when using this package. See the documentation for more details.
All files must contain a primary identifier for the unit of analysis. E.g. if you are de-identifying patient data, all files must contain a patient ID.
A config YAML must be created by the user.
Logging and output directories must be defined by the user and already created before running the package