ushaham / BatchEffectRemoval

55 stars 17 forks source link

Calibration

repository for the paper "Removal of Batch Effects using Distribution-Matching Residual Networks" by Uri Shaham, Kelly P. Stanton, Jun Zhao, Huamin Li, Khadir Raddassi, Ruth Montgomery, and Yuval Kluger.

The script Train_MMD_ResNet.py is the main demo script, and can be generally used for calibration experiments. It was used to train all MMD ResNets used for the CyTOF experiments reported in our manuscript. It loads two CyTOF datasets, corresponding to measurements of blood of the same person on the same machine in two different days, and denoises them. The script then trains a MMD-ResNet using one of the datasets as source and the other as target, to remove the batch effects.

The CyTOF datasets used to produce the results in the manuscript are saved in Data. The labels for the CyTOF datasets (personDay) were used only to separate the CD8 population during evaluation. Training of all models was unsupervised. The RNA data set Data2_standardized_37PCs.csv contains the projection of the cleaned and filtered data onto the subspace of the first 37 principal components. To obtain the raw data please contact Jun Zhao at jun.zhao@yale.edu.

All the models used to produce the results in the manuscript are saved in savedModels.

Any questions should be referred to Uri Shaham, uri.shaham@yale.edu.

Update April 7th 2020: A PyTorch implementation of the code can be found in https://github.com/ushaham/batchEffectRemoval2020.