DNA_jets contains the code used to characterise jets. Authors: Rosalba Garcia-Millan and Gunnar Pruessner
List of files: gplist_util.h gplist_util2.h data_structure.h gplist_util.c gplist_util2.c DNAsliding_window.c DNA_expected.c DNA_panoramic.c DNA_stencil.c panoramic_mode.c pseudoBAMfile.c
Compilation
cc -o DNAsliding_window DNAsliding_window.c gplist_util.c
cc -O3 -Wall -o BAM pseudoBAMfile.c gplist_util2.c -lm -lgsl -lgslcblas
cc -O3 -Wall -o exp DNA_expected.c gplist_util2.c -lm -lgsl -lgslcblas
cc -O3 -Wall -o Jets DNA_panoramic.c gplist_util2.c rgm_Gfit.c -lm -lgsl -lgslcblas
cc -O3 -Wall -o max panoramic_mode.c -lm -lgsl -lgslcblas
cc -O3 -Wall -o JetsSten DNA_stencil.c gplist_util2.c -lm -lgsl -lgslcblas
pseudoBAMfile: Generate file with total counts for each position x (in case BAM files are not provided).
DNA_expected: Generate file with the average count for a given distance between two positions (in case expected data is not provided).
DNA_panoramic: Generate panoramic curves.
panoramic_mode: Characterise orientation, and strength of each jet.
DNA_stencil: Generate stencil curves, characterise distances reached by the jet.
While running the pipeline above, the following path directories in the header file data_structure.h need to be updated.
bed_file: contains chromosome locations to be analysed
panoramic_file: contains panoramic curves output from DNA_panoramic.
jet_analysis_file: contains output from panoramic_mode.
jet_consensus_file: list of locations considered to have a jet.
jet_reach_file: contains output from DNA_stencil.