Open bhristov6 opened 6 months ago
Hi,
Can you do a manual installation (option 2) and try it again?
ok, will do and let you know.
Also, I noticed you're trying to run the trans-calling part. Have you tried to run the cis-calling part?
When running with the cis-calling part it generated file for most chromosomes but threw this error:
Writing chr22 .txt file Calculating expected counts from chromosome wise background dist Weight 1 0 15655 2 100000 386 3 200000 261 4 300000 201 5 400000 264 6 500000 131 A B Weight chr1 pos1 chr2 pos2 dist WeightOE 1: 30428 30428 23 chr21 100000 chr21 100000 0 0.6875759 2: 30428 30429 33 chr21 100000 chr21 200000 100000 39.9248705 3: 30428 30430 25 chr21 100000 chr21 300000 200000 44.6360153 4: 30428 30431 56 chr21 100000 chr21 400000 300000 129.5522388 5: 30428 30432 21 chr21 100000 chr21 500000 400000 36.9090909 6: 30428 30434 27 chr21 100000 chr21 700000 600000 235.3584906 [1] 147 [1] 100000 [1] 147 Writing chr21 .txt file Calculating expected counts from chromosome wise background Error in aggregate.data.frame(lhs, mf[-1L], FUN = FUN, ...) : no rows to aggregate Calls: lapply ... aggregate -> aggregate.formula -> aggregate.data.frame Execution halted
Few tips -
When you're re-running, please delete the old folder. Otherwise, it may generate some errors. If you haven't, please delete the old folders.
Please remove instances of non-standard chromosomes like chrY and chrM from the bed files. Due to a lack of interactions and small size, it may not find the bins to aggregate and can throw errors like this.
I followed your tips and now it runs smoothly on the cis-calling part but still segfaults on the trans-calling:
caught segfault address (nil), cause 'unknown'
Traceback: 1: functionsdchic::createtransijk(matpath, bedpath, paste0(outpath, "/", chr[l], ".inter.txt"), as.character(chr[l])) 2: FUN(X[[i]], ...) 3: lapply(1:length(chrom), extractTrans, chrom, normalizePath(df$mat[i]), normalizePath(df$bed[i]), paste0(df$prefix[i], "_pca/", "inter_pca/", df$prefix[i], "_mat")) 4: FUN(X[[i]], ...) 5: lapply(1:nrow(data), readfilesinter, data, pc, cthread, pthread, rowmrge, dirovwt) An irrecoverable exception occurred. R is aborting now ...
What resolution are you running the trans calls?
100kb
Do you see any file under inter_pca/<prefix>_trans_mat
?
I need to know if any specific chromosome is causing the issue.
yes, there is a file created for the fist chromosome chr1.inter.txt which is about 450MB.
I suggested you the following on May 9th -
"Please remove instances of non-standard chromosomes like chrY and chrM from the bed files. Due to a lack of interactions and small size, it may not find the bins to aggregate and can throw errors like this."
Did you remove the non-standard chromosome from the bed files?
If it is yes then it's good for the cis-calling part only.
I think for trans we need to filter the matrix files too.
During cis calculation, dchic is only looking at the intra matrices.
However, during trans mode dchic is also looking at either chrM or chrY.
And if any of the chromosomes is smaller than 100Kb and/or has very few counts it can generate a segmentation fault.
Yes, I have removed the non-standard chromosomes and I'm using only chr1-22.
Is it possible to share the data? Or at least a low-resolution like 1Mb data? I would like to reproduce the issue! abhiijit@lji.org
Is there a resolution for this issue? I am running the same "segfault" error when "--pcatype trans".
Best,
What resolution are you running this? I need to recreate the issue to resolve this.
Hello,
When I tried with different resolutions and provided in the same input data, the issue occurs. When only using one resolution, the issue is resolved. I think the input data for all the samples have to be the same resolution.
I do have an additional question about how to decide an optimal resolution for compartment call. Could you clarify?
Thank you for your help.
Best,
If you can raise a new issue regarding the optimal resolution, it would be great.
Thanks.
Hi, I installed dcHiC via the conda option and tried to call compartments in trans. After calculating the pca on chr1 the program segfaut-ed.
Written exp1_do_pca/inter_pca/exp1_do_mat/chr1.inter.txt
caught segfault address (nil), cause 'unknown'
Traceback: 1: functionsdchic::createtransijk(matpath, bedpath, paste0(outpath, "/", chr[l], ".inter.txt"), as.character(chr[l])) 2: FUN(X[[i]], ...) 3: lapply(1:length(chrom), extractTrans, chrom, normalizePath(df$mat[i]), normalizePath(df$bed[i]), paste0(df$prefix[i], "_pca/", "inter_pca/", df$prefix[i], "_mat")) 4: FUN(X[[i]], ...) 5: lapply(1:nrow(data), readfilesinter, data, pc, cthread, pthread, rowmrge, dirovwt) An irrecoverable exception occurred. R is aborting now ...