google / rappor

RAPPOR: Privacy-Preserving Reporting Algorithms
Apache License 2.0
859 stars 165 forks source link

fast_em.cc optimization potential #53

Open andychu opened 8 years ago

andychu commented 8 years ago

Instead of serializing the m x n x N cond_prob matrix to C++, we can serialize the m and n dimension separately (without the outer product).

We could either do the outer product up front in C++, or we could do it "lazily" on every EM step. This is more computation, but could actually speed things up because we would save a lot in memory bandwidth.