The function pd_extras.anonymize_database.anonymize_data splits a DataFrame containing private infos into two DataFrames containing private infos only and all the other data. The two resulting DataFrames are linked to each other thanks to a ID_OWNER column.
The values of the column are created by using nonces (random prefix and suffix added to each string containing all the private infos). The resulting strings are then hashed with SHA256.
Since we are using a random prefix and suffix, it may be useful to have the possibility of fixing the random seed.
The function
pd_extras.anonymize_database.anonymize_data
splits a DataFrame containing private infos into two DataFrames containing private infos only and all the other data. The two resulting DataFrames are linked to each other thanks to aID_OWNER
column. The values of the column are created by using nonces (random prefix and suffix added to each string containing all the private infos). The resulting strings are then hashed with SHA256. Since we are using a random prefix and suffix, it may be useful to have the possibility of fixing the random seed.