apicrafter / metacrafter

Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
Apache License 2.0
44 stars 5 forks source link

Add XML support #9

Open ivbeg opened 2 years ago

ivbeg commented 2 years ago

Support XML files with following list of tasks:

ivbeg commented 2 years ago

Current state:

Next steps: collect examples and write tests

ivbeg commented 2 years ago

Added automatic detection of XML tags

ivbeg commented 5 months ago

Replaced XML reader with pyiterable, should support huge XML files right now