Open Lotte-W opened 3 years ago
I don't think this can be done automatically, except for national ID numbers, which typically follow a certain scheme. We currently don't use any tool for this though.
It is very much a manual and vital process, that serves two purposes:
It is difficult to administrate and perform workflows on individual files for this purpose, consider therefore to administrate on data package level despite the less detail. This of course means some not-sensitive data will be restricted as part of a package with any sensitive data. This administration will make the manual workflow easier to manage.
How to automatically detect sensitive (private) information in born-digital documents/spreadsheets/email/.../metadata?