Closed Yaoyx closed 7 months ago
View / edit / reply to this conversation on ReviewNB
gfudenberg commented on 2024-02-14T00:29:45Z ----------------------------------------------------------------
Using adjacent boundaries to create a table of TADs
View / edit / reply to this conversation on ReviewNB
gfudenberg commented on 2024-02-14T00:29:45Z ----------------------------------------------------------------
Calling TADs from Hi-C data poses a challenge, in part because domain structures vary greatly in their size, intensity, and can be nested. The number of called TADs varies substantially from tool to tool, and can depend on tool-specific parameters (Forcato, 2017). Below, we show an example of how adjacent boundaries calculated with cooltools can specify a set of intervals that can be analyzed as TADs.
View / edit / reply to this conversation on ReviewNB
gfudenberg commented on 2024-02-14T00:29:46Z ----------------------------------------------------------------
Line #10. chrom += strong_boundaries.chrom.to_list()
could we do this in a pandas-native fashion rather than converting to list? e.g. via concat
View / edit / reply to this conversation on ReviewNB
gfudenberg commented on 2024-02-14T00:29:47Z ----------------------------------------------------------------
should show Hi-C data, either just below the shadow TADs or instead just highighting positions of TADs as light black lines over the Hi-C data
View / edit / reply to this conversation on ReviewNB
gfudenberg commented on 2024-02-14T00:29:48Z ----------------------------------------------------------------
Line #5. contact_matrix[int(row['start']/resolution):int(row['end']/resolution), int(row['start']/resolution):int(row['end']/resolution)] = 1
# can add a comment about visualizing the first 10 inter-boundary intervals vs. the data
c/o @Phlya : for insulation_and_boundaries.ipynb
We should include a cutoff on TAD size & rename process_chromosome
to something like convert_boundaries_to_TADs()
. He also has an alternate implementation in quaich that he can link here.
also c/o @Phlya : for contacts_vs_distance.ipynb
.
number of counts reported should match those in the curve (e.g. calculate just the number of contacts in the arm), and should make sure the aggregation is over the same number of regions in both raw & smoothed.
https://github.com/open2c/quaich/blob/0dc4936b569f9709ef7a4ba0ef8165519a845d20/workflow/rules/insulation_tads.smk#L71 link to the function to get TADs from insulation peaks
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB