FLAIR-THU / VFLAIR

THU-AIR Vertical Federated Learning general, extensible and light-weight framework
68 stars 18 forks source link

VFLAIR

News

Basic Introduction

VFLAIR is a general, extensible and light-weight VFL framework that provides vanilla VFL training and evaluation process simulation alonging with several effective communication improvement methods as well as attack and defense evaluations considering data safety and privacy. Aside from NN serving as local models for VFL systems, tree-based VFL is also supported.

VFLAIR provides simulation of the vanilla VFL process containing forward local model prediction transmits, backward gradient transmits as well as local and global model updates.

Code Structure

Framework Structure

The VFLAIR framework is structured around four key functional modules:

Quick Start

Zero. Environment Preparation

  1. Download code files and install all the necessary requirements.

     # clone the repository
     $ git clone <link-to-our-github-repo>
    
     # install required packages
     $ conda create -n VFLAIR python=3.8
     $ conda activate VFLAIR
     $ pip install --upgrade pip
     $ cd VFLAIR
     $ pip install -r requirements.txt
    
     # install cuda related pytorch
     $ pip install torch==1.10.1+cu113 torchvision==0.11.2+cu113 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu113/torch_stable.html
    1. New datasets, except MNIST, CIAFR10 and CIAFR100, can be put under ../../share_dataset/ folder on your device.

One. Basic Benchmark Usage: A Quick Example

  1. Customize your own configurations.

    • Create a json file for your own evaluation configuration in /src/configs folder. Name it whatever you want, like my_configs.json.
    • /src/configs/basic_configs.json is a sample configuration file. You can copy it and modify the contents for your own purpose.
    • For detail information about configuration parameters, see /src/configs/README.md for detail information.
  2. Use cd src and python main_pipeline.py --gpu 0 --configs <your_config_file_name> to start the evaluation process. A quick example can be launched by simplying using cd src and python main_pipeline.py (a vanilla VFL training and testing process is launched). For more detail descriptions, see Section Two.

Two. Advanced Usage: Implement Your Own Algorithm

Contributing

We greatly appreciate any contribution to VFLAIR! Also, we'll continue to improve our framework and documentation to provide more flexible and convenient usage.

Please feel free to contact us if there's any problem with the code base or documentation!

Citation

If you are using VFLAIR for your work, please cite our paper with:

@article{zou2023vflair,
  title={VFLAIR: A Research Library and Benchmark for Vertical Federated Learning},
  author={Zou, Tianyuan and Gu, Zixuan and He, Yu and Takahashi, Hideaki and Liu, Yang and Ye, Guangnan and Zhang, Ya-Qin},
  journal={arXiv preprint arXiv:2310.09827},
  year={2023}
}