GuoliangLi-HZAU / BatMeth2

BS-seq analysis pipeline
38 stars 13 forks source link

[E::sam_parse1] incomplete aux field #29

Open a-pallav opened 2 years ago

a-pallav commented 2 years ago

Hi,

I am encountering "incomplete aux field" when I am looking to execute pipel subcommand. please see detailed error log here.

[MM] /production/BatMeth2-master/bin/memalign c2t -1 Bis-AG876_S3_R1_001.fastq.clean.gz -2 Bis-AG876_S3_R2_001.fastq.clean.gz -o Bis-AG876.type2 | /production/BatMeth2-master/bin/bwame mem -t 8 -C -p -Y /production/References/ViralIndexes/batMeth/type2/EBV_type_2_complete_genome_FASTA_GenBank_NC_009334.1.fa.batmeth2.fa - | samtools sort -@ 8 -o ./Bis-AG876.type2.sort.bam - Process paired-end reads! Process input file: Bis-AG876_S3_R1_001.fastq.clean.gz, Bis-AG876_S3_R2_001.fastq.clean.gz [M::mem_align] read 0 ALT contigs [M::process] read 91692 sequences (10874058 bp)... [M::process] 0 single-end sequences; 91692 paired-end sequences [M::mem_pestat] # candidate unique pairs for (FF, FR, RF, RR): (1, 38061, 1, 0) [M::mem_pestat] skip orientation FF as there are not enough pairs [M::mem_pestat] analyzing insert size distribution for orientation FR... [M::mem_pestat] (25, 50, 75) percentile: (104, 129, 164) [M::mem_pestat] low and high boundaries for computing mean and std.dev: (1, 284) [M::mem_pestat] mean and std.dev: (137.14, 44.39) [M::mem_pestat] low and high boundaries for proper pairs: (1, 344) [M::mem_pestat] skip orientation RF as there are not enough pairs [M::mem_pestat] skip orientation RR as there are not enough pairs [M::mem_process_seqs] Processed 91692 reads in 6.221 CPU sec, 0.785 real sec [E::sam_parse1] incomplete aux field samtools sort: truncated file. Aborting

ZhouQiangwei commented 2 years ago

Hi, could u please run the command below and check whether the sam file 'Bis-AG876.type2.sam' is correct?

/production/BatMeth2-master/bin/memalign c2t -1 Bis-AG876_S3_R1_001.fastq.clean.gz -2 Bis-AG876_S3_R2_001.fastq.clean.gz -o Bis-AG876.type2 | /production/BatMeth2-master/bin/bwame mem -t 8 -C -p -Y /production/References/ViralIndexes/batMeth/type2/EBV_type_2_complete_genome_FASTA_GenBank_NC_009334.1.fa.batmeth2.fa - > Bis-AG876.type2.sam

a-pallav commented 2 years ago

Hi, could u please run the command below and check whether the sam file 'Bis-AG876.type2.sam' is correct?

/production/BatMeth2-master/bin/memalign c2t -1 Bis-AG876_S3_R1_001.fastq.clean.gz -2 Bis-AG876_S3_R2_001.fastq.clean.gz -o Bis-AG876.type2 | /production/BatMeth2-master/bin/bwame mem -t 8 -C -p -Y /production/References/ViralIndexes/batMeth/type2/EBV_type_2_complete_genome_FASTA_GenBank_NC_009334.1.fa.batmeth2.fa - > Bis-AG876.type2.sam

Hi, Yes, above command works like a charm. with 35MB sam file.

a-pallav commented 2 years ago

Even this is working: /production/BatMeth2-master/bin/memalign c2t -1 Bis-AG876_S3_R1_001.fastq.clean.gz -2 Bis-AG876_S3_R2_001.fastq.clean.gz -o Bis-AG876.type2 | /production/BatMeth2-master/bin/bwame mem -t 8 -C -p -Y /production/References/ViralIndexes/batMeth/type2/EBV_type_2_complete_genome_FASTA_GenBank_NC_009334.1.fa.batmeth2.fa - | samtools sort -o ./Bis-AG876.type2.batmeth2.sorted.bam

I think there is an issue with samtools threads usage using -@ flag.

ZhouQiangwei commented 2 years ago

Great. Maybe you can check the version of samtools on your server?

ZhouQiangwei commented 2 years ago

http://www.htslib.org/doc/samtools-sort.html the new version of samtools is support -@ threads paramater, could you please update the samtools in your server

a-pallav commented 2 years ago

I see. it is using system / usr/local/bin samtools. Commands are working after I updated system samtools. Thank you very much.

I have another question. Our sequences have good stretch of repeat elements. Do you have recommendations to set the thresholds for viral data?

ZhouQiangwei commented 2 years ago

In fact, the mapping and analysis of repeat regions has always been very difficult, especially for DNA methylation data. At present, there are no separate parameters for this part in mapping and methylation level calculation. You can also do some processing according to the situation. For example, merge repeat regions. I hope you can successfully perform the analysis.

a-pallav commented 2 years ago

Hi Guoliang,

Thank you very much for getting back. I have one more question. How do I run pipel command but with using bwa-meth as the aligner of choice? I figured the pipel command requires parmeter1/2 files that it creates automatically if I use bat-meth2 internal aligner.

Please advise,

aparna

From: momocoding @.> Date: Sunday, May 1, 2022 at 2:40 AM To: GuoliangLi-HZAU/BatMeth2 @.> Cc: Aparna Pallavajjala @.>, Author @.> Subject: Re: [GuoliangLi-HZAU/BatMeth2] [E::sam_parse1] incomplete aux field (Issue #29)

  External Email - Use Caution

In fact, the mapping and analysis of repeat regions has always been very difficult, especially for DNA methylation data. At present, there are no separate parameters for this part in mapping and methylation level calculation. You can also do some processing according to the situation. For example, merge repeat regions. I hope you can successfully perform the analysis.

— Reply to this email directly, view it on GitHubhttps://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FGuoliangLi-HZAU%2FBatMeth2%2Fissues%2F29%23issuecomment-1114148733&data=05%7C01%7Capallav2%40jhmi.edu%7Cc5bde7361ab64d7d522a08da2b3d7224%7C9fa4f438b1e6473b803f86f8aedf0dec%7C0%7C0%7C637869840161284975%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=eJ8insaBZd1haRbxG6Wn6cB482xmLSPXeU3S4TBDWPY%3D&reserved=0, or unsubscribehttps://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAY6IZI366HASURETNULGTCDVHYRMXANCNFSM5UVZ63LQ&data=05%7C01%7Capallav2%40jhmi.edu%7Cc5bde7361ab64d7d522a08da2b3d7224%7C9fa4f438b1e6473b803f86f8aedf0dec%7C0%7C0%7C637869840161284975%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=FtS%2B2qnKP9mX%2BKyqx3bHCPK9EAj0yslzmgIuOj4OISg%3D&reserved=0. You are receiving this because you authored the thread.Message ID: @.***>

ZhouQiangwei commented 2 years ago

hi, In the new version, we have removed the parameters of select aligner, and the BWA MEM align algorithm is used for alignment by default, so you can run bt2 directly. mo