awslabs / python-deequ

Python API for Deequ
Apache License 2.0
713 stars 134 forks source link

Replace test_data_quality_at_scale.ipynb #208

Closed komashk closed 1 month ago

komashk commented 3 months ago

Updated the dataset (amazon products reviews replaced with a synthetic data), added a couple of new examples

issue #207 issue #209

Description of changes:

Two updates:

  1. For test_data_quality_at_scale.ipynb: Updated the tutorial accompanying the blog post "Testing data quality at scale with PyDeequ". The blog has been recently updated and published.
  2. For the other ipynb tutorials (analyzers, profiles, repository, suggestions, verifications) updated S3 links, declaration of SPARK version before loading the library.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

review-notebook-app[bot] commented 3 months ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

chenliu0831 commented 1 month ago

Minor comment - I can approve when addressed.

komashk commented 1 month ago

Created a new pull request #230 to address the comments above.