domenicrosati / training-time-domain-authorization

0 stars 0 forks source link

Conceptual Analysis of Training Time Domain Authorization [Jan/David/Dom] #6

Open domenicrosati opened 3 weeks ago

domenicrosati commented 3 weeks ago

Issue

In order to ensure our benchmark fulfills the goals of measuring training-time domain authorization we need to understand what it is!

ToDo:

domenicrosati commented 1 week ago

Old Notes:

Not really a code task but this task involves starting an overleaf (Let's use the ICLR template 🤞 ) and really thinking through and formalizing https://docs.google.com/document/d/1YW26CdAKv06uc2CN09vf-w9RpkQbYU5yBMkaQBWkH84/edit (doesn't have to be very mathematical at first can be quite conceptual)

This is a lot like the analysis we developed in: https://arxiv.org/abs/2402.16382 but instead of necessary and suffecient conditions for a defence in ML security language we want the necessary and suffecient conditions for Domain Authorization including:

Outcome:

The beginning of our paper! A clear set of conditions which we will use to guide our benchmark construction. Ideally a reader reading this would say "Ok if a benchmark measured these things I would be convienced this domain authorization method works well and I would use it in industry to domain authorize my model"

Meta: Ideally someone other than Dom takes a first pass since Dom is too opinionated and has many gaps.