stjude-biohackathon / CRCminer

MIT License
2 stars 1 forks source link

Output all network nodes and edges. #7

Open j-andrews7 opened 1 year ago

j-andrews7 commented 1 year ago

In some sort of format that makes sense. For now, going to copy the CRC program output formats so that downstream reporting/viz can be used with both. File names from the example data/run that I used are in parentheses.

This represents ARNTL motifs found in enhancers (and/or promoters?) of the genes in the "TARGET" column. This can be used to create dynamically network plots from one or more TFs as selected by the user, optionally limiting to TF-TF interactions as designated in the last column (1 indicates that it's a TF-TF regulatory interaction).

Last column is gene assignments. I kind of hate this format, switching to a BED-like format would be easier to worth with. Maybe add another column for whether it's a TF or not, which is currently subsetted into the (A549_CRCs_ENHANCER_TF_TABLE.txt) file.

I feel it may be worth having two enhancer columns - one for super enhancers and one for "typical" enhancers.

This is still a WIP, will add more in a bit.