diffix / explorer

Tool to automatically explore and generate stats on data anonymized using Diffix
MIT License
2 stars 1 forks source link

Daniel/numeric value synthesis #164

Closed dandanlen closed 4 years ago

dandanlen commented 4 years ago

This commit adds synthesis of numerical column data. It uses the statistical library Accord.NET to generate a distribution from the histogram buckets.

Other additions:

// Roughly the same, except no background thread is created token.ThrowIfCancellationRequested(); await MyTask();


- ~Added a basic cache to the ContextBuilder because the api calls to get the datasources were taking forever to parse. Now the context builder checks the cached tables (if present) and if there is a validation error it refreshes the cache and validates again.~ Actually this change caused issues with authentication, see commit 83149f49120a15129c2e223df9b61244be2b2fc1
- Added a badge to the README to indicate build status (I know it's kind of pointless but I was curious how to do this 🤷)
sebastian commented 4 years ago

Estimation of distribution type (not sure this is useful but I found it an interesting idea)

That's super neat!