capitalone / DataProfiler

What's in your data? Extract schema, statistics and entities from datasets
https://capitalone.github.io/DataProfiler
Apache License 2.0
1.42k stars 158 forks source link

feat: add null ratio to column stats #1051

Closed suprabhatgurrala closed 10 months ago

suprabhatgurrala commented 10 months ago

Null ratio is computed as a global stat, but not on a column-level. I've made a simple change to compute null ratio as a column stat by dividing null_count / sample_size.

CLAassistant commented 10 months ago

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.


Suprabhat seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.

taylorfturner commented 10 months ago

Small change request is to checkout your branch off of dev instead of main and open the PR into dev. Thanks @suprabhatgurrala for the contribution!

suprabhatgurrala commented 10 months ago

My fork only had the main branch, will create a new PR which is branched of the dev branch