MinChen00 / UnlearningLeaks

Official implementation of "When Machine Unlearning Jeopardizes Privacy" (ACM CCS 2021)
GNU General Public License v3.0
44 stars 5 forks source link

The neg samples had exactly the same output on the original model and the unlearning model #8

Closed Aicyanic closed 6 months ago

Aicyanic commented 6 months ago

I was testing the DT model on Adult and found that the neg samples had exactly the same output on the original model and the unlearning model, which is what caused the difference attack to work so well. But it shouldn't be possible for both models to have exactly the same output for the same sample? Is there some detail I am overlooking please?

This is the posterior difference dataset used for the attack model in the project, and it seems that there are many neg samples where duplicates occur and the posterior is exactly the same. DT_adult_test