diffix / explorer

Tool to automatically explore and generate stats on data anonymized using Diffix
MIT License
2 stars 1 forks source link

Include datetime columns in correlation analysis. #360

Closed dandanlen closed 4 years ago

dandanlen commented 4 years ago

Simple datetime bucketing and synthesis for correlation analysis.

Uses the single column analysis to deduce a suitable bucket size (not too large, not too noisy). Generates a sample from the bucket snapped to the next-smaller time interval. (For example: bucket is an hour wide -> generate random number between 0 and 60 minutes and add these to the hour).