Open Yecats77 opened 6 months ago
Hey
Your run on the example is correct.
The subtype
column was meant to further classify the type
column. Missing value nan
in the subtype
column represents that the type
results cannot be further classified into more than one subgroups, as in the case for the reorganized TADs that are loss
in condition 2.
Thanks for your reply!
I understand what the subtype
mean at this time.
So, in this example 2, the results mean that there are no types of reorgnized TADs are identified under the condition 2, since all elements from type
column under conditon 2 are nan
. Is this understanding correct?
And may I ask how you compare and get this number "DiffDomain identifies that 30.771% of GM12878 TADs are reorganized in K562" based on the results in adjusted_TADs2.txt_types.txt
?
Thank you.
Best regards, Stacey LIU
2 To get the proportion of TADs that are reorganized in condition 2, we need to count two numbers.
1) the number of reorganized TADs
significant
column in adjusted_TADs2.txt_types.txt
indicates whether a TAD is significantly reorganized, with 1
standing for significant, 0
for not significant, and nan
for condition 2 TADs. Count the number of 1
should get the number of reorganized TADs.
2) the number of TADs
Count 'condition1' in the origin
column.
Hi,
I tried your example 2. The commands and output results(adjusted_TADs2.txt_types.txt) are attached.
In the result, it is observed that, for some regions, there are only condition 1 entries whose type is loss; while for other regions, the type of consition 2 are always nan. I am not sure what happend. May I ask whether I made any mistakes when using the diffDomain tools? Could you please help with this problem?
Thank you.
Best regards, Stacey LIU
adjusted_TADs2.txt adjusted_TADs2.txt_types.txt