Open evaesquinas opened 2 years ago
@evaesquinas Hi, I've tried your example data and it took me roughly 13 hours to finish the adjustment of 3 batches. I hope this can help you.
@evaesquinas hi, did you manage to figure out what is the best way to deal with the adjustment of multiple batches? Have you tried any other programs for this task? Faced the same problem on my data and still can't continue the analysis as the adjustment takes too long and seems to be endless. I would be grateful for any help and advice
@maxnest Hi! No, I did not sorry. I ended adjusting only by 1 batch. And I did not find any other way to solve the problem. Sorry.
Dear @zhangyuqing ,
I am trying to adjust my RNA data using ComBat-Seq since I realised that there are 3 batches that I need to adjust for:
I have 960 samples and around 62000 genes.
In my biological matrix, I have: Age, Sex, Group (cases,controls..) and WBC counts.
biol_mat = model.matrix(~Age + as.factor(Sex) + as.factor(Group) + LYMPH + MONO + NEUT, data=phenotype)
In the tutorial of
Combat-Seq
appears how to adjust by 1 variable but it doesn't tell you how to adjust by more than 1.I have seen a lot of posts using
combat
that the only way is to adjust by 1 variable and then, with those results, adjust again by the 2nd variable and so on.That would be:
Adjust by library prep.
Adjust by library prep + type of tube.
Adjust by library prep + type of tube + place
For the first adjustment (library prep) it takes around 15min, but for the second... it has been running for more than 2 days.. I stopped it and launch it again, changing the order of the adjustment but it is taking too much time and it seems that something is wrong...
Here I attach a similar example of my data:
Covariates (variables to adjust + biological information)
The 3 variables that I want to adjust for will be: Place, Tubes and LibPrep. On the other hand, the biological information that I want to preserve would be Age, Group, Sex and WBC.
RNA raw counts
The biological matrix would be:
biol_mat = model.matrix(~Age + as.factor(Sex) + as.factor(Group) + WBC, data=covariates_df)
Could you kindly help me, please? I have tried all the possibilities and I do not really know if it is something wrong my code or it is because
combat_seq
cannot handle too much amount of data.Thanks very much in advance Regards