Please read the paper here: https://arxiv.org/abs/2101.11693
Chosen as the best submission to ITU AI/ML in 5G Challenge for ITU-ML-5G-PS-022
While rich medical datasets are hosted in hospitals distributed across the world, concerns on patients' privacy is a barrier against using such data to train deep neural networks (DNNs) for medical diagnostics. We propose Dopamine, a system to train DNNs on distributed datasets, which employs federated learning (FL) with differentially-private stochastic gradient descent (DPSGD), and, in combination with secure aggregation, can establish a better trade-off between differential privacy (DP) guarantee and DNN's accuracy than other approaches. Results on a diabetic retinopathy~(DR) task show that Dopamine provides a DP guarantee close to the centralized training counterpart, while achieving a better classification accuracy than FL with parallel DP where DPSGD is applied without coordination.
report
: includes the final report. For 1.Design document showing the reasons for the choice of privacy-preserving technique and the network architectural components.
private_training
: includes the source code and a JupyterNotebook tutorial for training the privacy-preserving model explained in the report. For 2.Source code for the implementation of the privacy-preserving design across various architectural components.
private_inference
: includes the source code and demo for running inference on the privately trained model. For 3.Tested code and Test Report for all implementations- Implementations of Privacy-Preserving AI Technique, Trained Data Model, UI on smartphone.
video_demo
: include some video demos showing how to run training and inference. For 4. A Video of the demonstration of Proof-of-Concept.
We provided a Jupyter Notebook for training on Google Colab. Please see the file JNotebook_running_FSCDP_on_Colab.ipynb
in the private_training
folder.
Please use this link to get an inference on a Diabetic Retinopathy medical image:
https://imperial-diagnostics.herokuapp.com/
(Note: implementing the pure private inference is still in progress...)
If you find the provided code or the proposed algorithms useful, please cite this work as:
@article{malekzadeh2021dopamine,
title={Dopamine: Differentially Private Federated Learning on Medical Data},
author={Malekzadeh, Mohammad and Hasircioglu, Burak and Mital, Nitish and Katarya, Kunal and Ozfatura, Mehmet Emre and Gündüz, Deniz},
journal= {The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI-21)},
year={2021},
url = {https://github.com/ipc-lab/private-ml-for-health}
}
We kindly welcome collaboration and/or contribution to this work. Please feel free to drop a line to us via email or by opening an issue.