Open Stebalien opened 4 months ago
@Stebalien thanks for opening this issue!
NOTE: I'm asking about this document. If I'm in the wrong repo, please point me in the right direction.
The document you mentioned does have a new repo for keeping track of the design and issues: https://github.com/patcg-individual-drafts/hybrid-proposal In the PATCG we refer to this as the "hybrid proposal" of several preceding proposals. It would be great if we could move the discussion there.
The issue you describe and set of tradeoffs is indeed an interesting challenge to address and similar to how you describe comes fundamentally from site fragmentation of the base DP-privacy guarantee. Section 2 and Section 5 in this document DP Budgeting for Hybrid Proposal discusses this issue at more length and explore more best effort methods to mitigate it. In your described scenario specifically, a cross-site rate limiter with a short time window would provide a line of defense against registering one event with many redirects as if it happened on many sites.
The issue in patcg-individual-drafts/ipa#57 for IPA with a matchkey provider was more pronounced and not easily mitigated. With the hybrid proposal, site fragmentation of the privacy guarantee still needs to be addressed but there are many more tools at your disposal since privacy budgeting is done on the device.
NOTE: I'm asking about this document. If I'm in the wrong repo, please point me in the right direction.
I'm grappling with what I think is a fundamental issue and trying to understand how IPA/PPA solves it. Specifically, the following tradeoff:
This is possibly related to https://github.com/patcg-individual-drafts/ipa/issues/57, but I'm wondering how that was solved in PPA. Specifically, I'm concerned about a party that can pretend to be a large number of:
Such a party could:
Given a sufficiently large $W$, no amount of noise/differential privacy can hide the fact that the signal for $M_i$ is stronger than the signal for $M_0$. What am I missing?