capitalone / DataProfiler

What's in your data? Extract schema, statistics and entities from datasets
https://capitalone.github.io/DataProfiler
Apache License 2.0
1.41k stars 157 forks source link

Bug fix for float precision calculation using categorical data with trailing 0s #1125

Closed SchadtJ closed 4 months ago

SchadtJ commented 5 months ago

Bug fix for https://github.com/capitalone/DataProfiler/issues/1048#issuecomment-1973429619.

The float precision calculation errors out for categorical data when one of the values has leading/ trailing zeros. This is due to the regex operation stripping these zeros and the resulting value being outside the list of possible values.

Passing tests: dataprofiler-bug

CLAassistant commented 5 months ago

CLA assistant check
All committers have signed the CLA.

taylorfturner commented 4 months ago

reverting this accidental merge -- @SchadtJ please reopen into dev. Thanks!