In some sort of format that makes sense. For now, going to copy the CRC program output formats so that downstream reporting/viz can be used with both. File names from the example data/run that I used are in parentheses.
[x] List of all node genes (genes marked by a (super)enhancer).
[x] List of all TF node genes (TF genes marked by a superenhancer).
This represents ARNTL motifs found in enhancers (and/or promoters?) of the genes in the "TARGET" column. This can be used to create dynamically network plots from one or more TFs as selected by the user, optionally limiting to TF-TF interactions as designated in the last column (1 indicates that it's a TF-TF regulatory interaction).
[ ] (Super) enhancer BED files (A549_CRCs_ENHANCER_TABLE.txt). Currently, this format looks like:
Last column is gene assignments. I kind of hate this format, switching to a BED-like format would be easier to worth with. Maybe add another column for whether it's a TF or not, which is currently subsetted into the (A549_CRCs_ENHANCER_TF_TABLE.txt) file.
[ ] List of self loops (A549_CRCs_SELF_LOOPS.txt), these are just TFs that have a motif in one of their own enhancers.
SOX2
JUN
EGR1
[ ] List of genes and their associated enhancers, and their TF designation (A549_CRCs_GENE_SUMMARY.txt). In the example data, it's only the SE-associated genes as the SEs were the only enhancers I provided:
In some sort of format that makes sense. For now, going to copy the CRC program output formats so that downstream reporting/viz can be used with both. File names from the example data/run that I used are in parentheses.
This represents ARNTL motifs found in enhancers (and/or promoters?) of the genes in the "TARGET" column. This can be used to create dynamically network plots from one or more TFs as selected by the user, optionally limiting to TF-TF interactions as designated in the last column (1 indicates that it's a TF-TF regulatory interaction).
Last column is gene assignments. I kind of hate this format, switching to a BED-like format would be easier to worth with. Maybe add another column for whether it's a TF or not, which is currently subsetted into the (A549_CRCs_ENHANCER_TF_TABLE.txt) file.
[ ] List of self loops (A549_CRCs_SELF_LOOPS.txt), these are just TFs that have a motif in one of their own enhancers.
[ ] List of genes and their associated enhancers, and their TF designation (A549_CRCs_GENE_SUMMARY.txt). In the example data, it's only the SE-associated genes as the SEs were the only enhancers I provided:
I feel it may be worth having two enhancer columns - one for super enhancers and one for "typical" enhancers.
This is still a WIP, will add more in a bit.