Closed abhisheksinghnl closed 12 months ago
Hi there,
Thank you for your report. We'll try to reproduce the issue reported and get back to you ASAP. What's your output of packageVersion("DIscBIO")
?
packageVersion("DIscBIO")
Hi there,
Thank you for your report. We'll try to reproduce the issue reported and get back to you ASAP. What's your output of
packageVersion("DIscBIO")
?
Hi There,
Thank you for your reply.
The package version is 1.2.0
Looking forward to using 10x data.
Thank you
Hi there,
The problem seems to be directly caused by a presence of Inf
values on DataSet
. Most values in there look fine to me (see summary(DataSet)
), but the presence of a few infinite values (check table(DataSet == Inf)
) makes colSums()
return Inf.
Since na.rm
only removes NAs, but not Inf
s, the following code works around the issue:
DataSet[DataSet == Inf] <- NA # maybe instead of NA, use a meaninful but finite value
S1 <- summary(colSums(DataSet, na.rm=TRUE)) # gives an idea about the number of reads across cells
My output after that was:
> print(S1)
Min. 1st Qu. Median Mean 3rd Qu. Max.
2.886e+236 2.685e+284 3.313e+293 9.480e+302 5.593e+302 5.903e+303
I don't know if this makes sense methodologically, though.
Looking forward to your feedback!
Reproducible example available here: https://gist.github.com/wleoncio/a3f04d3bbe4ec89a1966c1b3c5be527e
Hi @abhisheksinghnl,
Could you please confirm if the solution presented above (https://github.com/ocbe-uio/DIscBIO/issues/41#issuecomment-1185254486) solves your issue?
Thanks!
Given the lack of feedback, I'll just assume the proposed solution fixes the issue. Please reopen if problem persists.
Dear Author,
Thank you for the excellent tool. I have tested the tool on the test dataset of CTC and it works flawlessly. I then tried to use it on a 10x dataset from GSE136103. It has 10 healthy and 10 diseased samples.
I have processed these using Seurat as follows
Then computed the count matrix per sample by taking average as below
Then I use DIscBIO pipeline as outlined for further analyses
library(DIscBIO)
Loading Dataset
FileName<-"liver" DataSet <- read.csv(file = paste0(FileName,".csv"), sep = ",",header=T) rownames(DataSet)<-DataSet[,1] DataSet<-(DataSet[,-1]) cat(paste0("The ", FileName," contains:","\n","Genes: ",length(DataSet[,1]),"\n","cells: ",length(DataSet[1,]),"\n")) sc<- DISCBIO(DataSet)
In the next step everything becomes infinity
I am not sure where I am going wrong and how can I fix this and in addition is DIscBIO capable of handling 10x data or it is only for FACS sorted and SMART-seq data?
Could you please guide me and give suggestion that could fix this problem.
Thank you