sdv-dev / SDV

Synthetic data generation for tabular data
https://docs.sdv.dev/sdv
Other
2.21k stars 287 forks source link

Custom Processing & Anonymization tutorial raises warning #2091

Closed amontanez24 closed 6 days ago

amontanez24 commented 1 week ago

Problem Description

The Custom Processing & Anonymization tutorial raises the following warning:

 Locales ['fr_CA'] do not support provider 'internet' and function 'email'.
In place of these locales, 'en_US' will be used instead. Please refer to the localized provider docs for more information https//faker.readthedocs.io/en/master/locales.html
  warnings.warn(
/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/rdt/transformers/pii/anonymizer.py102 UserWarning Locales ['fr_FR', 'fr_CA'] do not support provider 'credit_card' and function 'credit_card_number'.
In place of these locales, 'en_US' will be used instead. Please refer to the localized provider docs for more information https//faker.readthedocs.io/en/master/locales.html
  warnings.warn(

This is because the locales are updated to use ['fr_FR', 'fr_CA'], but Faker can't generate credit cards or emails for those regions.

Expected behavior

amontanez24 commented 1 week ago

This actually might not be an issue anymore. The demo should be rerun to make sure before making any changes

srinify commented 6 days ago

This is now updated & fixed, thanks for catching this!