GoogleCloudPlatform / dlp-dataflow-deidentification

Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
Apache License 2.0
89 stars 53 forks source link

Updating the limitaions on using complex file formats #185

Closed chitara-01 closed 10 months ago

chitara-01 commented 10 months ago

Summary (Short summary of what is being done) :

Updating the limitaions on using complex file formats

Description (Describe in detail the fix made) :

Based on the recent debugging and research done on AVRO/Parquet/JSONL pipelines, there are some limitations in the existing implementation. This pull request updates the README guide to make users aware of those limitations.

Bug ID (if any) :

b/310711896

Public Documentation (if any) :


TESTED (Test Cases with scenario and description - must have 1 positive and 1 negative scenario) :

Not required.

codecov[bot] commented 10 months ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Comparison is base (bd9f225) 13.41% compared to head (d208832) 13.41%.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #185 +/- ## ========================================= Coverage 13.41% 13.41% Complexity 67 67 ========================================= Files 53 53 Lines 2519 2519 Branches 213 213 ========================================= Hits 338 338 Misses 2161 2161 Partials 20 20 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.