This is a necessary step to resolving issue https://github.com/capitalone/DataProfiler/issues/820. Previously, _assimilate_histogram() called self to decide whether the given histogram contained integers or floats, and rounded the bins for histograms that only contained integers.
However, that rounding seems unnecessary. Here, we remove that rounding code entirely and modify the one test that fails, TestTextColumnProfiler.test_profile(). To make sure the test is still valid, here are its values:
NOTE: This is an alternative for PR https://github.com/capitalone/DataProfiler/pull/1071. If this is merged, then close PR https://github.com/capitalone/DataProfiler/pull/1071
Issue: https://github.com/capitalone/DataProfiler/issues/820
This is a necessary step to resolving issue https://github.com/capitalone/DataProfiler/issues/820. Previously,
_assimilate_histogram()
calledself
to decide whether the given histogram contained integers or floats, and rounded the bins for histograms that only contained integers.However, that rounding seems unnecessary. Here, we remove that rounding code entirely and modify the one test that fails,
TestTextColumnProfiler.test_profile()
. To make sure the test is still valid, here are its values:The data in the profile of that test is:
The old expected histogram is:
And the new expected histogram is: