diffix / explorer

Tool to automatically explore and generate stats on data anonymized using Diffix
MIT License
2 stars 1 forks source link

Text value no capturing non-anonymized values #326

Closed sebastian closed 4 years ago

sebastian commented 4 years ago

Clinic dataset, prescriptions table, display_startdatetime column. There are plenty of non-anonymized values in the result (query result link), but none are captured by explorer, and hence the sample values are synthesized.

sebastian commented 4 years ago

The same is the case for the firstname column in the patients table. There are plenty of non-anonymized firstnames returned, but the explorer data only shows synthetic data instead of real values. In fact it does not contain any information about real names existing.

(query result link)

sebastian commented 4 years ago

Yet another example is city in the addresses table. Lots of un-anonymized values, but none provided in the returned data.

AndreiBozantan commented 4 years ago

fixed with #329