apicrafter / metacrafter

Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
Apache License 2.0
43 stars 5 forks source link

Add identification of databases primary and foreign table identifiers #14

Open ivbeg opened 2 years ago

ivbeg commented 2 years ago

Some fields of databases are just incremental unique identifiers generated by the database engine. They can't be linked with any external identifier databases and are used only locally by databases.

There is a need to detect such fields as database-generated integer IDs. Common names of such IDs are id or <object name/table name>_id

The way to implement:

  1. Identify if the table field is a primary key and has an auto_increment feature
  2. Mark this field as database generated id key