world-federation-of-advertisers / cardinality_estimation_evaluation_framework

Evaluation framework and methods for estimating cardinalities of groups of sets
Apache License 2.0
22 stars 9 forks source link

Increase universe size. #66

Closed huangxichen1 closed 4 years ago

huangxichen1 commented 4 years ago

This PR increases the universe size from 100k to 1m for the complete evaluation.

The rationale is that the sketch tested are designed for the US internet population which is far larger than 100k. The small universe, and hence the small set size, will lead to a very sparse sketch, which causes us trouble in the local differential private scenario. That is, the signal (IDs added to the sketch) is relatively small compared to the noise. The number of estimable sets of ADBF under the small universe will be close to 0, which is misleading.