Closed KunFang93 closed 4 years ago
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website:https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again. Best, Yaqiang
Hi, Yaqiang
Thanks for your promptly reply and fixation!
Best, Kun
Sent from my iPhone
On Sep 14, 2020, at 10:00, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website:https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again. Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website:https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again. Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA.
#########################################################################
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe < https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe> for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again. Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA .
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA .
Hi, Yaqiang
I just got the reply from the author of HiC-Pro. The position of allValidPairs now changed back to 5’ end in the latest version (https://github.com/nservant/HiC-Pro/issues/362).
Best, Kun
On Sep 14, 2020, at 3:53 PM, YaqiangCao notifications@github.com wrote:
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe < https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe> for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again. Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA .
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA .
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692308254, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADXLIA2CRZDF3A5GDWG65JTSFZ7GDANCNFSM4RLVVTPA.
Hi Kun, Thank you for the reminder. Sorry for the late reply, a little busy for other projects these days. I may update the script later. May I ask if you loop-calling goes well for your data with cLoops? Best, Yaqiang
On Wed, Sep 16, 2020 at 11:35 AM KunFang notifications@github.com wrote:
Hi, Yaqiang
I just got the reply from the author of HiC-Pro. The position of allValidPairs now changed back to 5’ end in the latest version ( https://github.com/nservant/HiC-Pro/issues/362).
Best, Kun
On Sep 14, 2020, at 3:53 PM, YaqiangCao notifications@github.com wrote:
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe <
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again.
Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA
.
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692308254>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIA2CRZDF3A5GDWG65JTSFZ7GDANCNFSM4RLVVTPA .
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-693486897, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKMMB245RUMWQGLCG5DSGDLMPANCNFSM4RLVVTPA .
Hi, Yaqiang
No problem at all. I am still get small number of loops of the sample I mentioned. But I checked the quality of the sample, the quality is not very good, I guess this is why we don’t get enough loop from it. Besides, I was wondering if there is anyway to speed up the program? Some of my samples have already processed 30h and not finished yet. Thanks!
Best, Kun
On Sep 17, 2020, at 10:54 AM, YaqiangCao notifications@github.com wrote:
Hi Kun, Thank you for the reminder. Sorry for the late reply, a little busy for other projects these days. I may update the script later. May I ask if you loop-calling goes well for your data with cLoops? Best, Yaqiang
On Wed, Sep 16, 2020 at 11:35 AM KunFang notifications@github.com wrote:
Hi, Yaqiang
I just got the reply from the author of HiC-Pro. The position of allValidPairs now changed back to 5’ end in the latest version ( https://github.com/nservant/HiC-Pro/issues/362).
Best, Kun
On Sep 14, 2020, at 3:53 PM, YaqiangCao notifications@github.com wrote:
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao notifications@github.com wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe <
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again.
Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA
.
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692308254>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIA2CRZDF3A5GDWG65JTSFZ7GDANCNFSM4RLVVTPA .
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-693486897, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKMMB245RUMWQGLCG5DSGDLMPANCNFSM4RLVVTPA .
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694328463, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADXLIAZWHKLGGZGMDKWELHLSGIWMNANCNFSM4RLVVTPA.
Hi Kun, Thank you for the updates. Regarding the speed, I may not have better solution because from the bottom, cLoops is built up on Python. Following two tricks may work: 1) If you have enough CPUs and memories, -p set to chromosome numbers. 2) with -cut option, filter some close PETs, for Hi-C, for example, 5kb or 10kb. In my experience, run with -p 1 and for about 200 million PETs, it will take about 30-60 hours. Best, Yaqiang
On Thu, Sep 17, 2020 at 12:05 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
No problem at all. I am still get small number of loops of the sample I mentioned. But I checked the quality of the sample, the quality is not very good, I guess this is why we don’t get enough loop from it. Besides, I was wondering if there is anyway to speed up the program? Some of my samples have already processed 30h and not finished yet. Thanks!
Best, Kun
On Sep 17, 2020, at 10:54 AM, YaqiangCao notifications@github.com wrote:
Hi Kun, Thank you for the reminder. Sorry for the late reply, a little busy for other projects these days. I may update the script later. May I ask if you loop-calling goes well for your data with cLoops? Best, Yaqiang
On Wed, Sep 16, 2020 at 11:35 AM KunFang notifications@github.com wrote:
Hi, Yaqiang
I just got the reply from the author of HiC-Pro. The position of allValidPairs now changed back to 5’ end in the latest version ( https://github.com/nservant/HiC-Pro/issues/362).
Best, Kun
On Sep 14, 2020, at 3:53 PM, YaqiangCao notifications@github.com wrote:
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao < notifications@github.com> wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
<
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again.
Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <
https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA
.
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692308254 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA2CRZDF3A5GDWG65JTSFZ7GDANCNFSM4RLVVTPA
.
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-693486897 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AAOPQKMMB245RUMWQGLCG5DSGDLMPANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694328463>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIAZWHKLGGZGMDKWELHLSGIWMNANCNFSM4RLVVTPA .
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694334867, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKKZYIQRIT7B36BMCNDSGIXVRANCNFSM4RLVVTPA .
Hi, Yaqiang
Thanks for your info! Understand, I just a little bit worried the program was stuck before as I am not sure how long the processing will take. Since your experience is within 30-60h, I might just wait for it.
Best, Kun
On Sep 17, 2020, at 11:11 AM, YaqiangCao notifications@github.com wrote:
Hi Kun, Thank you for the updates. Regarding the speed, I may not have better solution because from the bottom, cLoops is built up on Python. Following two tricks may work: 1) If you have enough CPUs and memories, -p set to chromosome numbers. 2) with -cut option, filter some close PETs, for Hi-C, for example, 5kb or 10kb. In my experience, run with -p 1 and for about 200 million PETs, it will take about 30-60 hours. Best, Yaqiang
On Thu, Sep 17, 2020 at 12:05 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
No problem at all. I am still get small number of loops of the sample I mentioned. But I checked the quality of the sample, the quality is not very good, I guess this is why we don’t get enough loop from it. Besides, I was wondering if there is anyway to speed up the program? Some of my samples have already processed 30h and not finished yet. Thanks!
Best, Kun
On Sep 17, 2020, at 10:54 AM, YaqiangCao notifications@github.com wrote:
Hi Kun, Thank you for the reminder. Sorry for the late reply, a little busy for other projects these days. I may update the script later. May I ask if you loop-calling goes well for your data with cLoops? Best, Yaqiang
On Wed, Sep 16, 2020 at 11:35 AM KunFang notifications@github.com wrote:
Hi, Yaqiang
I just got the reply from the author of HiC-Pro. The position of allValidPairs now changed back to 5’ end in the latest version ( https://github.com/nservant/HiC-Pro/issues/362).
Best, Kun
On Sep 14, 2020, at 3:53 PM, YaqiangCao notifications@github.com wrote:
Hi Kun, Could you please share me a small chromosome data (such as chr21) for HiC-Pro output or converted BEDPE file. I will check the data. Best, Yaqiang
On Mon, Sep 14, 2020 at 4:08 PM KunFang notifications@github.com wrote:
Hi, Yaqiang
Sorry for bothering but when I am trying to call hic loops of gsm1631184, I only get 2 loops. The hic data is processed by hicpro with the attached config file and then use the new hicpropairs2bedpe. I was wondering if there is any way to debug where goes wrong? Thanks!
Best, Kun
On Sep 14, 2020, at 10:00 AM, YaqiangCao < notifications@github.com> wrote:
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website: https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY < https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY>.
Best, Kun
Hi Kun, Thank you very much for the reminding. Sorry for the carelessness. I have fix the problem, please refer to
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
<
https://github.com/YaqiangCao/cLoops/blob/master/scripts/hicpropairs2bedpe
for updated code (line 17-18). Please let me kown if you have found other problems. Have a nice day. Thank you again.
Best, Yaqiang
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <
https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692115131 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA3HMMOTIFWXWLMYCWLSFYVZFANCNFSM4RLVVTPA
.
Please change the variable settings below if necessary
#########################################################################
Paths and Settings - Do not edit !
#########################################################################
TMP_DIR = tmp LOGS_DIR = log BOWTIE2_OUTPUT_DIR = bowtie_results MAPC_OUTPUT = hic_results RAW_DIR = rawdata
#######################################################################
SYSTEM AND SCHEDULER - Start Editing Here !!
####################################################################### N_CPU = 25 LOGFILE = hicpro.log
JOB_NAME = Jingwei_HiC JOB_MEM = 20gb JOB_WALLTIME = 600:00:00 JOB_QUEUE = JOB_MAIL = fangk@livemail.uthscsa.edu
#########################################################################
Data
#########################################################################
PAIR1_EXT = _R1 PAIR2_EXT = _R2
#######################################################################
Alignment options
#######################################################################
MIN_MAPQ = 10
BOWTIE2_IDX_PATH = /data/kun/Softwares/hiclib/hiclib-legacy/bin/bowtie2/index BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder BOWTIE2_LOCAL_OPTIONS = --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder
#######################################################################
Annotation files
#######################################################################
REFERENCE_GENOME = hg19 GENOME_SIZE = /data/kun/Jingwei/annotation/chrom_hg19.sizes
#######################################################################
Allele specific analysis
#######################################################################
ALLELE_SPECIFIC_SNP =
#######################################################################
Capture Hi-C analysis
#######################################################################
CAPTURE_TARGET = REPORT_CAPTURE_REPORTER = 1
#######################################################################
Digestion Hi-C
#######################################################################
GENOME_FRAGMENT = /data/kun/Jingwei/annotation/HindIII_resfrag_hg19.bed LIGATION_SITE = AAGCTAGCTT MIN_FRAG_SIZE = 100 MAX_FRAG_SIZE = 100000 MIN_INSERT_SIZE = 100 MAX_INSERT_SIZE = 800
#######################################################################
Hi-C processing
#######################################################################
MIN_CIS_DIST = GET_ALL_INTERACTION_CLASSES = 1 GET_PROCESS_SAM = 0 RM_SINGLETON = 1 RM_MULTI = 1 RM_DUP = 1
#######################################################################
Contact Maps
#######################################################################
BIN_SIZE = 20000 40000 150000 MATRIX_FORMAT = upper
#######################################################################
Normalization
####################################################################### MAX_ITER = 100 FILTER_LOW_COUNT_PERC = 0.02 FILTER_HIGH_COUNT_PERC = 0 EPS = 0.1
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692285321 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/AAOPQKOJAKIDM6TE7VWCHF3SFZZ2NANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-692308254 , or unsubscribe <
https://github.com/notifications/unsubscribe-auth/ADXLIA2CRZDF3A5GDWG65JTSFZ7GDANCNFSM4RLVVTPA
.
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-693486897 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AAOPQKMMB245RUMWQGLCG5DSGDLMPANCNFSM4RLVVTPA
.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694328463>, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADXLIAZWHKLGGZGMDKWELHLSGIWMNANCNFSM4RLVVTPA .
— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694334867, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOPQKKZYIQRIT7B36BMCNDSGIXVRANCNFSM4RLVVTPA .
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/YaqiangCao/cLoops/issues/23#issuecomment-694338304, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADXLIA5Y3NQ6ANJLV3GTKCTSGIYM3ANCNFSM4RLVVTPA.
Hi, Yaqiang
I found that hicpro2bedpe is not working properly as the code misunderstands the meaning of 11th and 12th col. They are not read length but the mapping quality. Please check the this website:https://groups.google.com/g/hic-pro/c/qhHNbQVJfYY.
Best, Kun