Closed dawe closed 3 years ago
Hi,
you get this error if you try to access a region that is not in the matrix. Therefore your statement interval at line n+1 is perfectly defined interval on the chromosome
makes me wonder. Can you validate this for both matrices? Especially if the regions come from a different algorithm. Was the identical
interaction matrix used to compute it? Arrowhead is imho from Juicer software and they use not the cool, but the hic format. Maybe the data was mapped to a different reference genome, leading to issues especially at the end of a chromosome?
Best,
Joachim
Indeed those regions were found with Juicer on a .hic file. I converted the .hic to .cool, plus the offending lines aren’t at the end of the chromosome. Also, running only on the offending line does not raise the error.
I see. Is that a public dataset? Or is there the chance you provide me with a sample of where this happens? I could investigate in detail what is going on.
I wish I could share these data with you but I don't own them. I guess I'll try to debug them and I'll come back when (and if) I can solve this isse.
Just to be clear, hicDifferentialTAD
only reads given coordinates in both matrices files, and does not expect anything but coordinates, right? Is there the chance that columns 4,5,6 are used?
I think I understood what's happening here. TAD definitions from juicer/arrowhead include some overlapping entries, such as:
1 39310000 39550000
1 39320000 39450000
In which the second interval is included in the first one. So, when hicDifferentialTAD
takes the coordinates of the second one to test the inter-rightTAD
, the coordinates are out of frame.
@dawe Can you test if it is fixed in the current develop branch?
@joachimwolff Sorry, I was out of office. I'm back now and testing the fix, it seems to be working smoothly.
Welcome to the HiCExplorer GitHub repository! Before opening the issue please check that the following requirements are met :
[x] Search whether this issue (or a similar issue) has been solved before using the search tab above. Link the previous issue if appropriate below.
[x] Paste your HiCExplorer version (
hicInfo --version
) and your python version (python --version
) below.conda create --name hicexplorer hicexplorer=3.6 python=3.8 -c bioconda -c conda-forge
and activate the environment:conda activate hicexplorer
. Retry your command. You can exit a conda environment viaconda deactivate
. To learn more about conda and environments, please consider the following documentation.Retry your command, is it solved now? If not please continue with the following:
Hi all, I have been trying to use
hicDifferentialTAD
on some data of mine. First of all, it works fine when I feed it with domains.bed produced withhicFindTADs
, so I guess it is not an issue with the data themselves. I'd like to test TAD produced by some collaborators (defined with ArrowHead), to do so I just take their file and produce a 6-columns file to feedhicDifferentialTAD
. I get the error pasted above. I tried to isolate the offending line and I found that, for example,hicFindTADs
I tried to use pdb to debug this, but I'm having some issues with queues. Any idea on how to manage this issue? Side question: is it possible to use any set of given intervals as input for
hicDifferentialTAD
(assuming the intervals make sense, of course).