π We will change the license to Apache-2.0 in the next release.
Figure 1: An Example for FedAvg. You can create a scenario using generate_DATA.py
and run an algorithm using main.py
, clientNAME.py
, and serverNAME.py
. For a new algorithm, you only need to add new features in clientNAME.py
and serverNAME.py
.
π―We create a user-friendly algorithm library and evaluation platform for those new to federated learning. Join us in expanding the FL community by contributing your algorithms, datasets, and metrics to this project.
π―If you find our repository useful, please cite the following paper:
@article{zhang2023pfllib,
title={PFLlib: Personalized Federated Learning Algorithm Library},
author={Zhang, Jianqing and Liu, Yang and Hua, Yang and Wang, Hao and Song, Tao and Xue, Zhengui and Ma, Ruhui and Cao, Jian},
journal={arXiv preprint arXiv:2312.04992},
year={2023}
}
37 traditional FL (tFL) and personalized FL (pFL) algorithms, 3 scenarios, and 20 datasets.
Some experimental results are avalible in its paper and here.
Refer to this guide to learn how to use it.
This library can simulate scenarios using the 4-layer CNN on Cifar100 for 500 clients on one NVIDIA GeForce RTX 3090 GPU card with only 5.08GB GPU memory cost.
PFLlib primarily focuses on data (statistical) heterogeneity. For algorithms and an evaluation platform that address both data and model heterogeneity, please refer to our extended project Heterogeneous Federated Learning (HtFLlib).
As we strive to meet diverse user demands, frequent updates to the project may alter default settings and scenario creation codes, affecting experimental results.
Closed issues may help you a lot.
When submitting pull requests, please provide sufficient instructions and examples in the comment box.
The origin of the data heterogeneity phenomenon is the characteristics of users, who generate non-IID (not Independent and Identically Distributed) and unbalanced data. With data heterogeneity existing in the FL scenario, a myriad of approaches have been proposed to crack this hard nut. In contrast, the personalized FL (pFL) may take advantage of the statistically heterogeneous data to learn the personalized model for each user.
Thanks to @Stonesjtu, this library can also record the GPU memory usage for the model. Following FedCG, we also introduce the DLG (Deep Leakage from Gradients) attack and PSNR (Peak Signal-to-Noise Ratio) metric to evaluate the privacy-preserving ability of tFL/pFL algorithms (please refer to ./system/flcore/servers/serveravg.py
for example). Now we can train on some clients and evaluate performance on other new clients by setting args.num_new_clients
in ./system/main.py
. Note that not all the tFL/pFL algorithms support this feature.
Traditional FL (tFL)
FedAvg β Communication-Efficient Learning of Deep Networks from Decentralized Data AISTATS 2017
Update-correction-based tFL
SCAFFOLD - SCAFFOLD: Stochastic Controlled Averaging for Federated Learning ICML 2020
Regularization-based tFL
FedProx β Federated Optimization in Heterogeneous Networks MLsys 2020
FedDyn β Federated Learning Based on Dynamic Regularization ICLR 2021
Model-splitting-based tFL
MOON β Model-Contrastive Federated Learning CVPR 2021
FedLC β Federated Learning With Label Distribution Skew via Logits Calibration ICML 2022
Knowledge-distillation-based tFL
FedGen β Data-Free Knowledge Distillation for Heterogeneous Federated Learning ICML 2021
FedNTD β Preservation of the Global Knowledge by Not-True Distillation in Federated Learning NeurIPS 2022
Personalized FL (pFL)
FedMTL (not MOCHA) β Federated multi-task learning NeurIPS 2017
FedBN β FedBN: Federated Learning on non-IID Features via Local Batch Normalization ICLR 2021
Meta-learning-based pFL
Per-FedAvg β Personalized Federated Learning with Theoretical Guarantees: A Model-Agnostic Meta-Learning Approach NeurIPS 2020
Regularization-based pFL
pFedMe β Personalized Federated Learning with Moreau Envelopes NeurIPS 2020
Ditto β Ditto: Fair and robust federated learning through personalization ICML 2021
Personalized-aggregation-based pFL
APFL β Adaptive Personalized Federated Learning 2020
FedFomo β Personalized Federated Learning with First Order Model Optimization ICLR 2021
FedAMP β Personalized Cross-Silo Federated Learning on non-IID Data AAAI 2021
FedPHP β FedPHP: Federated Personalization with Inherited Private Models ECML PKDD 2021
APPLE β Adapt to Adaptation: Learning Personalization for Cross-Silo Federated Learning IJCAI 2022
FedALA β FedALA: Adaptive Local Aggregation for Personalized Federated Learning AAAI 2023
Model-splitting-based pFL
FedPer β Federated Learning with Personalization Layers 2019
LG-FedAvg β Think Locally, Act Globally: Federated Learning with Local and Global Representations 2020
FedRep β Exploiting Shared Representations for Personalized Federated Learning ICML 2021
FedRoD β On Bridging Generic and Personalized Federated Learning for Image Classification ICLR 2022
FedBABU β Fedbabu: Towards enhanced representation for federated image classification ICLR 2022
FedGC β Federated Learning for Face Recognition with Gradient Correction AAAI 2022
FedCP β FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy KDD 2023
GPFL β GPFL: Simultaneously Learning Generic and Personalized Feature Information for Personalized Federated Learning ICCV 2023
FedGH β FedGH: Heterogeneous Federated Learning with Generalized Global Header ACM MM 2023
FedDBE β Eliminating Domain Bias for Federated Learning in Representation Space NeurIPS 2023
FedCAC β Bold but Cautious: Unlocking the Potential of Personalized Federated Learning through Cautiously Aggressive Collaboration ICCV 2023
PFL-DA β Personalized Federated Learning via Domain Adaptation with an Application to Distributed 3D Printing Technometrics 2023
Knowledge-distillation-based pFL (more in HtFLlib)
FedDistill (FD) β Communication-Efficient On-Device Machine Learning: Federated Distillation and Augmentation under Non-IID Private Data 2018
FML β Federated Mutual Learning 2020
FedKD β Communication-efficient federated learning via knowledge distillation Nature Communications 2022
FedProto β FedProto: Federated Prototype Learning across Heterogeneous Clients AAAI 2022
FedPCL (w/o pre-trained models) β Federated learning from pre-trained models: A contrastive learning approach NeurIPS 2022
FedPAC β Personalized Federated Learning with Feature Alignment and Classifier Collaboration ICLR 2023
For the label skew scenario, we introduce 14 famous datasets: MNIST, EMNIST, Fashion-MNIST, Cifar10, Cifar100, AG News, Sogou News, Tiny-ImageNet, Country211, Flowers102, GTSRB, Shakespeare, and Stanford Cars, they can be easy split into IID and non-IID version. Since some codes for generating datasets such as splitting are the same for all datasets, we move these codes into ./dataset/utils/dataset_utils.py
. In the non-IID scenario, 2 situations exist. The first one is the pathological non-IID scenario, the second one is the practical non-IID scenario. In the pathological non-IID scenario, for example, the data on each client only contains the specific number of labels (maybe only 2 labels), though the data on all clients contains 10 labels such as the MNIST dataset. In the practical non-IID scenario, Dirichlet distribution is utilized (please refer to this paper for details). We can input balance
for the iid scenario, where the data are uniformly distributed.
For the feature shift scenario, we use 3 datasets that are widely used in Domain Adaptation: Amazon Review (fetch raw data from this site), Digit5 (fetch raw data from this site), and DomainNet.
For the real-world scenario, we also introduce 3 naturally separated datasets: Omniglot (20 clients, 50 labels), HAR (Human Activity Recognition) (30 clients, 6 labels), PAMAP2 (9 clients, 12 labels). For the details of datasets and FL algorithms in IoT, please refer to FL-IoT.
If you need another data set, just write another code to download it and then use the utils.
cd ./dataset
# python generate_MNIST.py iid - - # for iid and unbalanced scenario
# python generate_MNIST.py iid balance - # for iid and balanced scenario
# python generate_MNIST.py noniid - pat # for pathological noniid and unbalanced scenario
python generate_MNIST.py noniid - dir # for practical noniid and unbalanced scenario
# python generate_MNIST.py noniid - exdir # for Extended Dirichlet strategy
The output of python generate_MNIST.py noniid - dir
Number of classes: 10
Client 0 Size of data: 2630 Labels: [0 1 4 5 7 8 9]
Samples of labels: [(0, 140), (1, 890), (4, 1), (5, 319), (7, 29), (8, 1067), (9, 184)]
--------------------------------------------------
Client 1 Size of data: 499 Labels: [0 2 5 6 8 9]
Samples of labels: [(0, 5), (2, 27), (5, 19), (6, 335), (8, 6), (9, 107)]
--------------------------------------------------
Client 2 Size of data: 1630 Labels: [0 3 6 9]
Samples of labels: [(0, 3), (3, 143), (6, 1461), (9, 23)]
--------------------------------------------------
for MNIST and Fashion-MNIST
for Cifar10, Cifar100 and Tiny-ImageNet
for AG_News and Sogou_News
for AmazonReview
for Omniglot
for HAR and PAMAP
Install CUDA v11.6.
Install conda latest and activate conda.
conda env create -f env_cuda_latest.yaml # You may need to downgrade the torch using pip to match the CUDA version
Download this project to an appropriate location using git.
git clone https://github.com/TsingZ0/PFLlib.git
Create proper environments (see Environments).
Build evaluation scenarios (see Datasets and scenarios (updating)).
Run evaluation:
cd ./system
python main.py -data MNIST -m CNN -algo FedAvg -gr 2000 -did 0 # using the MNIST dataset, the FedAvg algorithm, and the 4-layer CNN model
Note: It is preferable to tune algorithm-specific hyper-parameters before using any algorithm on a new machine.
If you need to simulate FL under practical situations, which includes client dropout, slow trainers, slow senders, and network TTL, you can set the following parameters to realize it.
-cdr
: The dropout rate for total clients. The selected clients will randomly drop at each training round.-tsr
and -ssr
: The rates for slow trainers and slow senders among all clients. Once a client is selected as a "slow trainer"/"slow sender", for example, it will always train/send slower than the original one. -tth
: The threshold for network TTL (ms). It is easy to add new algorithms and datasets to this library.
To add a new dataset into this library, all you need to do is write the download code and use the utils which is similar to ./dataset/generate_MNIST.py
(you can also consider it as the template).
To add a new algorithm, you can utilize the class Server and class Client, which are wrote in ./system/flcore/servers/serverbase.py
and ./system/flcore/clients/clientbase.py
, respectively.
To add a new model, just add it into ./system/flcore/trainmodel/models.py
.
If you have a new optimizer while training, please add it into ./system/flcore/optimizers/fedoptimizer.py
The evaluation platform is also convenient for users to build a new platform for specific applications, such as our FL-IoT and HtFLlib.
If you are interested in the experimental results (e.g., the accuracy) of the above algorithms, you can find some results in our accepted FL papers (i.e., FedALA, FedCP, GPFL, and DBE) listed as follows that also use this library. Please note that this developing project may not be able to reproduce the results on these papers, since some basic settings may change due to the requests of the community. For example, we previously set shuffle=False
in clientbase.py
@inproceedings{zhang2023fedala,
title={Fedala: Adaptive local aggregation for personalized federated learning},
author={Zhang, Jianqing and Hua, Yang and Wang, Hao and Song, Tao and Xue, Zhengui and Ma, Ruhui and Guan, Haibing},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={37},
number={9},
pages={11237--11244},
year={2023}
}
@inproceedings{Zhang2023fedcp,
author = {Zhang, Jianqing and Hua, Yang and Wang, Hao and Song, Tao and Xue, Zhengui and Ma, Ruhui and Guan, Haibing},
title = {FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy},
year = {2023},
booktitle = {Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining}
}
@inproceedings{zhang2023gpfl,
title={GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated Learning},
author={Zhang, Jianqing and Hua, Yang and Wang, Hao and Song, Tao and Xue, Zhengui and Ma, Ruhui and Cao, Jian and Guan, Haibing},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={5041--5051},
year={2023}
}
@inproceedings{zhang2023eliminating,
title={Eliminating Domain Bias for Federated Learning in Representation Space},
author={Jianqing Zhang and Yang Hua and Jian Cao and Hao Wang and Tao Song and Zhengui XUE and Ruhui Ma and Haibing Guan},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=nO5i1XdUS0}
}