domenicrosati / training-time-domain-authorization

0 stars 0 forks source link

A Benchmark for Training-time Domain Authorization

Project Goal: A Public Reusable and Replicable Toolkit for Rigerously Evaluating Training-time domain authorization solutions.

After evaluating a defence on this benchmark, an evaluator should be able to say: This defence provide training time domain authorization with a high-bar of empirical evidence.

Inspiration: BIER, GEM, DecodingTrust, HarmBench

Setup & Dev Stuff

$ curl -sSL https://install.python-poetry.org | python3 -
$ poetry install

The project uses Ruff with a Ruff pre-commit hook just for consistent styling.

Development workflow

Adding dependencies

Use poetry add for all dependencies

Project Structure

Experiment scripts are located in the experiments directory. The scripts are named according to the experiment they run. The scripts are written in bash and are used to run the experiments. The scripts are used to run the experiments and save the results in the results directory.

main.py is the main entry point into the project.

arguments.py contains all the arguments which should be a global variable to avoid code mess

Experiments

Add documentation on all experiments added here: