awslabs / amazon-s3-find-and-forget

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Apache License 2.0
238 stars 35 forks source link

XML Support #379

Open nonameplease opened 1 year ago

nonameplease commented 1 year ago

Hi team,

We are currently using AVRO, XML, and plain binary formats for the data landed in data lake in the raw zone. We are transforming all data stored in the curated zone as Parquet format. However, there are compliance requirements to also delete customer data in the raw zone. It would be awesome if this solution can be further enhanced to support additional formats.