GoogleCloudPlatform / professional-services-data-validator

Utility to compare data between homogeneous or heterogeneous environments to ensure source and target tables match
Apache License 2.0
385 stars 108 forks source link

test: Add unicode character tests #1173

Closed nj1973 closed 2 weeks ago

nj1973 commented 3 weeks ago

Add row integrations tests for a wider set of characters than standard test data, e.g. data with accents.

For most SQL engines the test is a hash comparison.

For Teradata the hashing UDF does not accept unicode values and therefore that test is based on concat output.

I've skipped the MySQL test because unicode characters are being dropped:

FR LYXFJDER vs FÖR LYXFJÄDER
nj1973 commented 3 weeks ago

/gcbrun