RAGentV: Retrieval-based Access Control Policy Generation (Unparallelized)

Code repository for the paper "RAGent: Retrieval-based Access Control Policy Generation". RAGent is an access control policy generation framework developed by using language models and large language models to,

Identify natural languge policies (NLACPs) from high-level requirement specification documents of an organization (using BERT LM)
Retrieves organization-specific information (e.g., subjects, actions, resources, purposes, and conditions) pre-defined in the system that is relevant to translate the NLACPs via dense retrieval.
Translate identified NLACPs into structured representations through Retrieval Augmented Generation (RAG) seperating access control rules (ACRs) with five policy components: subjects, actions, resources, purposes, and conditions (using LLaMa 3 8B Instruct)
Automatically verify the generated structured representations and provide feedback if the representation is incorrect, mentioning the reason/error (using BART LM).
Based on the verification result, iteratively refines the incorrect policy maximum n times until the verification result indicates correct.
Even after n rounds, if the verifier still finds the policy is incorrect, it will be sent back to the administrator for manual refinement before applying it to the authorization system.

By developing this framework, we improve the reliability of the automated policy generation process and, in turn, reduce the data breaches due to access control failures in the future.

Demo

The provided demo with HotCRP privacy policies can be run in the terminal using the following commands.

$ git clone https://github.com/accessframework/RAGent.git
$ pip install gdown
$ gdown --folder https://drive.google.com/drive/folders/1-kcQZEEU0ZMcH7PakNSF-87YiqY9A8Xx
$ cd RAGent/demo
$ ./demo.sh

The results, including the final generations and intermediate generations if the refinement is involved, will be saved in the demo directory.

Setup

Installation

Clone the repository

$ git clone https://github.com/accessframework/RAGent.git
$ cd RAGent/

(Recommended) Create a new python virtual environment

$ python3 -m venv .venv
$ source .venv/bin/activate

Install the dependencies

$ pip install -r requirements.txt

NOTE : All the parts of the framework were only tested on a Ubuntu machine with NVIDIA A100-SXM4-80GB GPU and 1007G memory.

Checkpoints

Download the checkpoints necessary to reproduce the results.

$ gdown --folder https://drive.google.com/drive/folders/1-kcQZEEU0ZMcH7PakNSF-87YiqY9A8Xx

Inference

The easiest way to reproduce the results reported in the paper is to run the trained models on the prepared datasets used in the paper. To this end, we will explain how to run the models for each step of the framework in the following sections.

NLACP Identification

$ cd identification/
$ python evaluate_classification.py --mode=<mode>

Options:

$ python evaluate_classification.py --help
Usage: evaluate_classification.py [OPTIONS]

Evaluates RAGent for NLACP identification module.

Options:
  --mode [t2p|acre|ibm|collected|cyber|overall]
                                  Mode of training (document-fold you want to evaluate the trained model on) [default: t2p; required]

Access control policy generation

We have provided the vectorstores needed for the policy generation in data/vectorstores.

When evaluating the proposed framework's ability to generate structured representations from the identified NLACP, we follow two main approaches as mentioned in the paper.

Access control policy component extraction: Evaluating its ability to extract policy components from NLACPs
- SAR : Extracting subjects and resources for each action (to compare with previous research as they cannot extract any other components)
- DSARCP : By going beyond all the existing frameworks, extracting access decisions, subjects, resources, purposes, and conditions for each action
Access control rule generation: Evaluating its ability to generate ACRs directly from NLACPs (NOTE: Each ACR should contain its own access decision (which is not considered in existing frameworks), subject, action, resource, purpose, and condition)

SAR Evaluation

To reproduce the results under SAR setting as reported in the paper, either load the model utilizing the downloaded checkpoints and evaluate on the dataset using the following commands with <mode> = [collected|ibm|t2p|acre|cyber]

$ cd generation/evaluation/
$ python eval_ragentv_sar.py --mode=<mode>

Options:

$ python eval_ragentv_sar.py --help
Usage: eval_ragentv_sar.py [OPTIONS]

  Evaluates RAGent in SAR setting.

Options:
  --mode [t2p|acre|ibm|collected|cyber|overall]
                                  Mode of training (document-fold you want to
                                  evaluate the trained model on)  [default:
                                  ibm; required]
  --result_dir TEXT               Directory to save evaluation results
                                  [default: results/eval/sar/]
  --use_pipe                      Whether or not the transformers pipeline to
                                  use for generation
  --help                          Show this message and exit.

use the pre-saved entities for each document-fold to generate the comparison between our framework and the related frameworks by running the following command.

$ python generate_comparison.py

DSARCP Evaluation

$ cd generation/evaluation/
$ python eval_ragentv.py --mode=<mode> --refine

Options:

$ python eval_ragentv.py --help
Usage: eval_ragentv.py [OPTIONS]

  Evaluates RAGent in DSARCP setting.

Options:
  --mode [t2p|acre|ibm|collected|cyber|overall]
                                  Mode of training (document-fold you want to
                                  evaluate the trained model on)  [default:
                                  ibm; required]
  --result_dir TEXT               Directory to save evaluation results
                                  [default: results/sarcp/]
  --n INTEGER                     Number of entities to retrieve per each
                                  component  [default: 5]
  --refine                        Whether to conduct the verification and
                                  iterative refinement
  --no_retrieve                   Whether to retrieve information
  --no_update                     Whether to conduct the post-processing with
                                  retrieved information
  --use_pipe                      Whether or not the transformers pipeline to
                                  use for generation
  --help                          Show this message and exit.

After running the above command it will result in two F1 scores, one showing the ability to extract components, and the other showing the ability to generate ACRs.

Access control policy verification

To train and evaluate the BART model across train, validation, and test sets run the following.

$ cd verification/
$ python train_test_verifier_single_split.py

Options:

$ python train_test_verifier_single_split.py --help
Usage: train_test_verifier_single_split.py [OPTIONS]

  Trains the access control policy verifier using a single random train, val,
  test splits

Options:
  --dataset_path TEXT          Directory generated verification datasets
                               [default: ../data/verification; required]
  --train_epochs INTEGER       Number of epochs to train  [default: 10]
  --learning_rate FLOAT        Learning rate  [default: 2e-05]
  --batch_size INTEGER         Batch size  [default: 8]
  --out_dir TEXT               Output directory  [default:
                               ../checkpoints/verification]
  --help                       Show this message and exit.

To reproduce the results reported in the paper, run the follwoing command.

$ python eval_test.py

The final test results can be found in verification/results/final_test_result.txt

Next we will see how to train each component of our proposed framework with your own data.

Training

NOTE: We used a Ubuntu machine with NVIDIA A100-SXM4-80GB GPU and 1007G memory to train each module of the framework. The training datasets will be released upon request.

NLACP Identification

To fine-tune BERT with your own data to identify NLACPs, run the following commands,

$ cd identification/
$ python train_classifier.py [OPTIONS]

with options,

Usage: train_classifier.py [OPTIONS]

  Trains the NLACP identification module

Options:
  --dataset_path TEXT    Location of the dataset to train the model
                         [required]
  --max_len INTEGER      Maximum length for the input sequence
  --batch_size INTEGER   Batch size  [required]
  --epochs INTEGER       Number of epochs  [required]
  --learning_rate FLOAT  Learning rate  [required]
  --out_dir TEXT         Directory to save the checkpoints  [required]
  --help                 Show this message and exit.

Access control policy generation

To fine-tune LLaMa 3 8B Instruct LLM with Parameter Efficient Fine-Tuning (PEFT) for access control policy generation, run the following commands,

$ cd generation/
$ python train_generator.py [OPTIONS]

with options,

Usage: train_generator.py [OPTIONS]

  Training the LLaMa 3 8B for generating access control policies from NLACPs

Options:
  --train_path TEXT     Huggingface dataset name  [required]
  --out_dir TEXT        Output directory  [default: ../checkpoints/]
  --batch_size INTEGER  Batch size  [default: 8]
  --lr FLOAT            Learning rate  [default: 0.0002]
  --seed INTEGER        Random seed  [default: 1]
  --help                Show this message and exit.

Access control policy verification

As mentioned in the verifier inference, the verifier can be trained and tested using the following command.

$ cd verification/
$ python train_test_verifier_single_split.py

After training all the components using your own datasets then they can be evaluated as mentioned in Inference.

accessframework / RAGent

readme

RAGentV: Retrieval-based Access Control Policy Generation (Unparallelized)

Demo

Setup

Installation

Checkpoints

Inference

NLACP Identification

Access control policy generation

SAR Evaluation

DSARCP Evaluation

Access control policy verification

Training

NLACP Identification

Access control policy generation

Access control policy verification