Open lxwgcool opened 1 year ago
Data Root
/scratch/lix33/lxwg/Project/H_pylori/Processed
BAM
/scratch/lix33/lxwg/Project/H_pylori/Processed/BAM
Contains 3 different types of BAM, including Raw, Mapped (Unique) and Unmapped
Raw BAM /scratch/lix33/lxwg/Project/H_pylori/Processed/BAM/*/Raw
Mapped BAM (Unique) /scratch/lix33/lxwg/Project/H_pylori/Processed/BAM/*/Mapped
UnMapped BAM /scratch/lix33/lxwg/Project/H_pylori/Processed/BAM/*/UnMapped
* Variant Calling results
/scratch/lix33/lxwg/Project/H_pylori/Processed/CSV
* Log
/scratch/lix33/lxwg/Project/H_pylori/Processed/Log (1) The calling command line and related history are recorded here
CMD
samtools view -@ ${threads} -q 10 -F 4 -F 256 -h ${input.bam} | grep -v -E -e '\bXA:Z:' -e '\bSA:Z:' | samtools view -b - > ${output.bam}
Details
Reference
Command line
samtools view -h -b -f 4 ${input.bam} > ${output.bam}
Details
1: Easy case
(1) Normal, discored (1), normal
2: Midean Level case
(1) front goes to back (304)
* still pick it out)
3: Hard level case
(1) Number of reads from back to front
* not report all
* only find if there is discord in these number of reads
(2) order goes to reverse order
* just ignore it
(3) number of reads together but the order arrangment is pretty complex.
(4) the case of Chimeric
Hey Difei,
Everything is all set.
The results have been uploaded into:
/scratch/lix33/lxwg/Project/H_pylori/Processed/Methylation
Please check the files below:
The related github issue is:
https://github.com/NCI-CGR/IlluminaSequencingAnalysis/issues/57
The latest commit is:
https://github.com/NCI-CGR/IlluminaSequencingAnalysis/commit/f49dc3cf0531e8477567029643b34bcd2a9d0d08
The column “LiftoverAlignedPosRef(Methylation)” in CSV file is what you are looking for.
I am happy this job could be done in 2022.
Have a great new year
Best
Xin
1: Explain CIGAR Mask https://samtools.sourceforge.net/samtools/bam/PDefines/PDefines.html
2: How to do offset for different CIGAR code https://sourceforge.net/p/samtools/mailman/message/29373646/
Orphan Methylation
Set the position to its nearest reference
Add mask "(*)"
The related reference base will be set as "*"
Three different purposes