Open lxwgcool opened 3 years ago
(1) Add the case of "capture kit = "WGS "" (2) Print more info in log file to make it looks better. (3) Add multiple thread feature for samtools
(1) Load samtools 1.13 instead of 1.8 to support the feature of multuple threads. (2) Add "WGS_BED" and "WGS_TOTAL_BASES" a) Homo_sapiens_assembly38.bed
(1) Move the part of "calling samtools" inside if section of GATK, since the samtools need to be called only if GATK need to be called. (2) Change the way to calcualte BASES_Q_AVE (use the same method as the case of WES) (3) Change the way to collect "MEAN_INSERT_SIZES" a) use column 6 instead of column 5
(1) send keytable name as an argument (2) append keytable name to coverage report
(1) get input arguments at the beginning (2) Append keytable name in the the log file dir (coverage) (3) Correct the caption field of the coverage report. (4) Use 8 cores to submit jobs
(1) send keytable name as an argument (2) append keytable name to coverage report
(1) get input arguments at the beginning (2) Append keytable name in the the log file dir (pre-calling) (3) Correct the caption field of the pre-calling QC report. (4) Use 8 cores to submit jobs
(1) Load samtools 1.13 instead of 1.8 to support the feature of multuple threads.
(1) Print more info in the log file of "MergeBAM" (2) Change the way to call "CoverageReport" 1) Additional argument (2) Change the way to call "PreCallingQCReport" 1) Additional argument (3) Change the way to find pre-calling QC report 1) Use the pattern "strKTName"(keytable name) (4) Corrected the of printing report location
(1) Add "-L" to find soft-link (2) Set strASSAYID by using "EZ_WGS_PE"
I have changed the code and use “Homo_sapiens_assembly38.bed” for calculation. As a result 2 different bed files will be used for our COVID project
CDS Reference: /data/COVID_WGS/lix33/DCEG/CGF/Bioinformatics/Production/data/CDS/v38/BedFileForRef38_CCDS.MergedOverlap.Brief.bed
Normal Capture kit: /data/COVID_WGS/lix33/DCEG/CGF/Bioinformatics/Production/data/ref38/Homo_sapiens_assembly38.bed
Please ignore this column. I just checked the code, the original code is out of date: the matrix and caption fields are inconsistent. These %Merge Dup and % Merge Optical Dup are never be calculated.
I have updated the caption fields.
Same as before: the original code is out of date: the matrix and caption fields are inconsistent. I have updated the caption fields.
Yes, I find the logic of the calculation is incorrect in original code. I have updated the code.
Same as before: the original code is out of date: the matrix and caption fields are inconsistent. I have updated the caption fields.
Same as before: the original code is out of date: the matrix and caption fields are inconsistent. I have updated the caption fields.
I also added some parallel computing features in the code.
I have submit jobs to redo these 2 reports and will notify you once everything is all set.
After Kristie checked three types of report, we found some bugs.
We have fixed all these bugs and also added some new features into the code. For details, please check the comments below.