philres / ngmlr

NGMLR is a long-read mapper designed to align PacBio or Oxford Nanopore (standard and ultra-long) to a reference genome with a focus on reads that span structural variations
MIT License
293 stars 40 forks source link

Parse error at line #75

Open marcotoffoli opened 4 years ago

marcotoffoli commented 4 years ago

Good morning,

I'm having an issue with alignment and subsequent sorting of files. My pipeline is: sequencing with MinION (Nanopore), basecalling and demultiplexing with Guppy, alignment with NGMLR, sorting with samtools. I'm using the latest version of all software.

The command for NGMLR is: ngmlr -t 8 -r human_hg19_mod.fa -q file.fastq -o output.bam -x ont

With this, I get: Done (3591 reads mapped (7.32%), 45467 reads not mapped, 49449 lines written)(elapsed: 1m, 51 r/s)

Then I run: samtools sort output.bam > sorted.bam

and get: [W::sam_read1] Parse error at line 10440 samtools sort: truncated file. Aborting

So my questions are: why is NGMLR mapping only 7.32% of reads? What is the problem with samtools sort?

Thank you, Marco

marcotoffoli commented 4 years ago

Just a quick update: I went back to samtools version 1.9 (i was previously using 1.10) And the sorting worked without a problem.

Any idea why this is happening?

wdecoster commented 4 years ago

Does ngmlr actually write in BAM format? Is output.bam maybe a SAM file? Also, I would suggest you to use samtools sort output.bam -o sorted.bam to make sure the output format is bam.

marcotoffoli commented 4 years ago

Hi wdecoster,

From NGMLR help: -o <string>, --output <string> Adds RG:Z:<string> to all alignments in SAM/BAM

So I'm guessing NGMLR can output a .bam file?

Also, I used the very sam epipeline a lot in the past and didn't get any error. The parse error appeared only after upgrading to samtools 1.10. After going back to samtools 1.9 the error disappeared.

So I'm guessing samtools 1.10 doesn't like the output from NGMLR?

wdecoster commented 4 years ago

That NGMLR help string is actually a mistake, and what you just pasted is the help of --rg-id, but that's something that's wrong in the code and should not be related to this issue.

I don't know if you can write a bam, and perhaps you should just check with the unix file command: file output.bam and see what you get. SAM is text, BAM would return you "gzip compressed data, extra field". Here are the changes to samtools in 1.10: https://github.com/samtools/samtools/releases

marcotoffoli commented 4 years ago

Dear wdecoster,

You are actually right, the output from NGMLR is a SAM file, even though I called it .bam. I just realigned with NGMLR using .sam for output, but when I then try to run samtools view: samtools view -b file.sam -o file.bam

I get the error : [W::sam_read1] Parse error at line 32273 [main_samview] truncated file.

Any idea?

wdecoster commented 4 years ago

I would try to take a look at that line, and see if I can spot what's wrong. Is it the last line of the file?

marcotoffoli commented 4 years ago

If I try to run the same command samtools view -b file.sam -o file.bam

with samtools 1.9, with the same input file, it works without returning the parse error.

marcotoffoli commented 4 years ago

The offending line is: a41c1f02-4a89-4bd3-b44b-68b3a71f10d9 2048 1 155188772 -2147483648 8705S14M1I5M1D65M1D6M1I4M2D4M1D29M1I1M1D1M1D1M1I35M1I7M1D1M1D15M48S TACTTGTACTTCATTCAGTTACGTATTGCTGGTGCTGAAAGTTGTCGGTGTCTTTGTGTTAACCTACTTGCCTGTCGCTCTATCTTCCAACCTTTCTTCCTTCTTCTCAATGGGCCCAGGACCTTGACGGCCAGCAGCGGGCTCCAAGGCCCAGGCTTTTACCAACATGACAGGCTACTGGCTGGGCCCAGGCAAGGGGCCTTGGCAGGAAAGTTCCTTGCTGTACCTCCACTGCACTCAGAGGCCAGTGAGGGGGTGCCAAGACAGGACTCCTTCCTCCCCTGGTGAAGTGCCCTTGCAGCTCCCCAGCGTCCACGCCATGGATATTTCCTCCACAGAAGTACAATGCTGATTATGATCTGTCAGCTCAGCAAGGGCAGACACCCTGGCCTTCATGTCTCTCCTGGAGGAGAAGTTGATCCCAGTGCTGGTGAGTGTGCCCAGACCTCCCAGCATCCATGGCCAGCCAGGGAGGGACGGGCACACACAGACCCACAGAGACTCAGGGAGGCATGGAGGTCAGAAGCCCACCTTGAATCAGACAGGTGCACTGGCTCAGACCTGCCTGTTCCTTCCTGCCCACCCAATCCAGGTACATACTTTTTGGATGAGCACCAAGAACTACGTAGAAGTGACCCGGAAGTGGTATGCAGAGGCTATGCCCTTTCCCTCAGCTTCTTCCTGCCTGGCCGCATGCAGCGACAGTGCATGGAACGGCTACAGCTGCTGACTAGAGACTTTGACTGAGATGAGAAGAGTTGGAGAAGGAGGTAGCTCTGAGACCGGGGCTATTGTATGAGATTGAGCCAAGGATGCTGGCCAGGAATGGGAGTGCTTAGAGTGCAGAGGTGGCACATTCACAGCTACCAAAACCTACCTGTGTCGCCCCTACAGCTGTACCAAGAGGCTCAGGAGTGTCTGACCCTGCTCTCTCAGCGCCTGGGCTCTCAAAAAAGTTCTTCTTTGGAGATGCGTGAGTCTGACTCCAGAAGGTGCAATGGGTGGCTTGGAAGAAGATGCAGGTTCAGATGGAGCAGCTGGAGCTGGGGCTGGGGCTGGGGCTGGCTCAGGCTCTGGATAGGAGGTCCCTGAGACAGATACTGGCTGGTGACAGTGGGGCGTGGGGCCAGAGCCTTCTCAGGAGTACAAGGGGTAGGGTGGGAGGGCAGCCAGGCACAGGAAGGGCCTGAAAGAGCTGTGGGGCACTGAGTGTGCCCTTTATGCAGCCCTGGGATGAGCCCTATTCAGGGCCAGGCTGGCAGCACCACCTGGGGATCTCTCCCCATACCAGGTCTAGGACTGTGTGTCCTGTCCTTCCCTAAGTGGCCGCCTGCTGCCCAGAGCCCACCTCCCAAGGCTGACTCTTCCTCCGGCTCCATCTTTACCCCTTCTACCCCAGTGGTTCTCCTCCATCCCACCCTTCTCTCTCTGCTCCAGCCACCTGCCTCCTTGGACACCTTCACGTCTTCAGCTACTTGGCCCTGCTGCTGCAGGCAAAGCTGCCCAGTGGGAGCTGCAAGTCCACCTGCGTGGGCTGCACAACCTCTGTGCCTATTGTACCCACATTCTCAGTCTCTACTTCCCCTGGGATGGAGGTAAGGGGCAGATGGGAGGGGCAGCCCTGGGGGAGTGGGCAGGGATCAAGAACTAGTTCTCCTAACACACCTTCCTTCCTTGACCCTCAGCTGAGGTACCACCGCAGCACTTGAACACCAGCAGGCCCAGAGACTGAGGAGGAATATACCGGCGCCGGAACCAGGTCCTACCTGTGCTGGCAGGTTTGGCAGCCATGGTGGGCTACGCCTTGCTCAGCGGCATTGTCTCCATCCAGCGGGCAACGCCTGCTCCGGGCCCAGGCACCGGACCCTGGGCATGGCTGAGGAGGATGAGAGATGATTTGTCCTCACGCTCCCAAGACTGGTTTTCTACTCTCATGCATTCCAGAGGCCCCCGTGCCTCCTCGTTATTGGTACAGCCGGACGCTGCGGTGCTGCCACCCAGAATAAAGCCACTCACATGACTGGGCTCAAACATTTTCTCCTTTAAGAGCTGCCATTTTCCTGGCTGGTGCCATAGGAATCATCTGGGTGCCTGGGCACACCCGCTGCTGCTTTAAGGCTTCCGCCCTGATGCTGACACTGCTGCTCCACGGGCCCAGTTCTGCATCTCCAGGAAAGACAAACAGTCTCCAGTTTGGGCCCAGCTTTCCTAGTCTCTTCTTTCCTTACCCTCAGCCCTGATCTTGTGTTTGTACAGGACAGTGAGCTCACCCTAGGCCTGGACCCAGGCCCAGTTTCTAAAGCAAGCAGCACACAGCGCATGTTCACATAAGCATGGGGCTGGGGGACACTGGGGCTTACTGATCTTTTTCTAGGCCTCCCAGCCTATGGCACCACCTAGAGGAGAAGTGAGTCACCCAAACCATTGCCCCCTGGGCTTACGTCGCTGTAAGCTCACACTGGCCCTGCTGTGCCCTCTTTAGTCACAGACAGCGTGTGAGCTGACTCTGTCCTTTAATGCCCAGGCTGAGCCCAGTGCCTCCTTGGTATCTGCTCTATCACTGGCGACGCCACAGGCCAGGTGTGAATGGAGTAGCCAGGTGAGATTGTCTCAGGAAGCCCACAGCAGGATCCTTGATGGTAAGAGGCACATCCTTAGAGGCTAGGGAGCAGGGAGGAGAAGCTGAGAGTGTGATCCTGCCAAGGCCCCCAACGCTGTCTTCAGCCCACTTCCCCAGACCTCACCATTGCCCTCACCGGTTTAGCACGACCACAACAGCAGAGCCATCGGGATGCATCAGTGCCACTGCGTCCAGGTCGTTCTTCTGACTGGCAACCAGCCCCACTCTCTGGGAGCCCTCAGGAATGAACTTGCTGAATAACAGAGACAGATGGTCAGAGTCCCCTCAGGGTACCTCCATGAAACCCTCATCTAAGAAGTCACCCACCCTGGACCCACCCCATAACTCCTGCAGAGGCTCTGCCCTGGCTCTCTAGGCCTGGAGCCATGCTGCTGGGCACTGACCCTGCTTTTCTGCATCGCAGTCCAGCCTCAGGCATTGGGGTTTTCTGTTGCTACCTAGTCACTTCCCACTGCCTCCATGGCGCAAAAGGGGATGGGTGTGCTCTTCGAGGTTCCACCTTGAACACCTTCCTGCTCCCTCGTGGTGTAGAGTGATGTAAGCCACCCGATGTGGGGATGATAGGCCTGGTATGGAATGGGGTGCCGCCCTCCACTCACCTGAAGTGGCCAAGGTGGTAGAACATGGGCTGTTTGTAAAACGTGCCCTTGGTGATGTCTACAATGATGGGACTGTCGACAAAAGTTACGCACCCAATTGGGTCCTCCTTCGGGGTTCAGGGCAAGGTTCCAGTCGGTCCAGCCCGACCACATGGTACAGGAGGTTCTGAGGTAAGGACAAGGCAAAGAGACAAGGCTCAACACTGGGGGTCCCCAGAGAGTGTAGGTAAGGGTCACATGTGGGAGAGGCAGCTGTGGGTAGGTCAGCCCTGTGAGGGCACATTCCTTAGTAGCTAAGGAGTTGGGGGTGTGAAGATCCAGGCATCTCAAGGGGAGCTGAAGTCTGAGGCAGCTGCAAGTGCCTCAGTAGTTGCAAAGGGGCAATGAGGTGTGCAGACCTGTGAAGGGAAAGGGAAGATAGGGAATCCATGGTTCCCCAGAGTTGCTCAAAAGGGCAGGCTAGCTGGGAAGCTGGACAGGAAGGGCTTCTGTCAGTCTTTGGTGAAACTAGTAAGAGGTCTGAGGTCTGCTTTGCAGGAAGGGAGACTGGGGTGGCTTACCGTGATGATGCTGTGGCTGTACTGCATCCCTCGATCCCAGGAGCTAGCCGCACACTCTGCTCCCAGAACTTGGAGCCCACACACAGGCCTCTGAGGCAAGAGCATGTGTTGGGGAACAGGCGGTATTGTGCTCCCCTAGGGTGTGGCTTTGGCTGGAGCCAGAAAGTCCAGGTACCAATGTACAGCAATGCCATGAACATATTTAGCTGCTTCTGGGTCTGTCGGTACCTGCAAGGAAGAGCAGCGATCCTGGACCTTGCACACAGGCTTCTGGAACTTCTAGTTCCTGTTGTAGGAATCCTGGAGGTGGGTGACGGGAAGAATGCAGCTAGAGAGGTTTGGGGAGATTTTTTTGTTTTTGAGACAGGATGTGTCACTCTGTTCGGGCTGGAGTACAGTGGCGCAATCACGACTCACTGCAGCCTTGACTTCCTAGGGTCAACTGATCCTCCCACATTAGCCTCCTGAGTAGCTGGGACTACACGGGTGCCACACCCAGCTAATTTGTGTGTGTATGTGTGTGTATGTATGTGTGTGTGTGTATATATATATACATATAAACACATATATATGTATATACACATACAGCCATGAACCACCACCCCCAGCCTAGATAGTTTTGTTTGTTTTGTTTCGAGATGGAGTCTCGCTCAGCCTCCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACCACAGCCACCATCTCCCGGGTTCAGCGGTCTCCCTTCTCAGCCTCCTGAGTAGCTGGGATTACAGACACCCACCATCATGCCAGATAATTTTTTTTTTGTATTTTAGTAGACACAGGGTTTCAACACGTTGGCCAGGGTGGTCTTAAACTCCTGACCTCAGGTGATCACCCGCCTCGGCTTCCCAAAGTGCTGGGTTTGCATGAGTGAGCCACCTCGCCCAGCCCCTAGAAAGGTTTCAAGCGACAACTGTGGGATCCATGGCACCCTGGAGGTCAGGGAATGATGCTCTAGGGAATCCATAGTTGGGTAAGAGAAATCGCTCTAAGTTTGGGAGCCAGTCATTTGGATGCTGGATTTGAAGGTCACTGGAGCACCATGGAGGTCAGGCCTTACCACCTTTGCCCAGTGGGGCAGCAGCAAGCGTTGGTCATCCAGCATGAGTAGGCGGACATTGTGGTGAGAATTTACTGTTGGCGAGGGTAGGACCTAGGTCACGGGCAATGAAGTCTCGCTGATGTTCAGGGGTGAAGCCCAGGCACTGGAAGGAGTATCCACTCAACAGCCCAGCAGAAGGCTCATTCTCAGCTGTACTGCCCAGAACTGTAACTTGTGCTCAGCATAGGCATCCAGGAACCTGGCAAGAGAAAGGTCATGAATGATCCGGCCAAAGAAAGTGGACCAGACCAGCTGGGTGTGGTGGCTCACACCTGTGAATCCCAGCACTTTGGAAGCCGAGGCAGGTGGATCACTTGAGTTCAGGAGTTCGAGAACAGCCTGGCGGAACCCCGTCTCTACTAAAAATAGAAAAATCAGCCGGGCCTGGTGGCAGGCGCCTATAATCCAGCTACTTGGAAGGCTGAGGCAGGAGAATTGCTTGAACTCAGGAGGCAAGGTTACAGTGAGTGAAGATGGCGCCACAGCACTCCAGCCTGGGTGACAGAGAGACTCCTTCTCAAAAAAAAATAAAAAGAAGAAAAATAAAAAAGAAAGTGGGCCAGACCGAGAGAACAGGAAGCCTGATGGAGTGGGCAAGATTGACAGGCCCAAGGCTGAGCCCAGAAGGTAGAAAGGTGGGCTGAGGACAGGCAGATCTGGAAGTGGAACTAGGTTGAGGGTTGGGACACAGATCCAGCATGGCTAAATGGGAGAGCCAGTCCTGATCCACGTCCTTGCTGATCCCTTACTTCACAAAGTATCTGGCCCAGGTCTGGTGGTAGATGTCTCGGGCTGTCCCTTGAGTGACCCCTTCCCATTCACCGCTCCATTGGTCTTGAGCCAAGTGGGTGATGTCCAGGGCTGGCAAGGAGTGAAACGGGACGCTGGGCCAACTGCAGGGCTCAGTGAATCCATCTAGAGACAAAGGTAGTGAAGAGAAGCACCCAGAGTTGGAACACATACTAGCCCAACCAGTGCATCCGGTTCAGCCATTAGCCCCACCCTCCCGCCCCCAGGACAAAACAGCAGGGGACAAAATGTCTGTACAAGCAGACCTACCCTACAGTTTCTCCAACCCCCAGACATCAGGGCCCTCAGGGCCTGAAGCTAAGAATGCCTACCTTGAGCTTGGTATCTTCCTCTGGGAGGCTGAAGTTGTGCAACTGGAAATCATCAGGGGTGTCTGCATAGGTGTAGGTGCGGATGGAGAAGTCACAGCTGGCCATGGGTACCCGGATGATGTTATATCCGATTCCTACAGAAAGGATAGTCAAGACATGGTAGTCCGAGTCAATAGGAGAGTAGGACTCTGCTTATCTTGCCAGTCCCTAATAGTGTCTGAGTCAGGGCCAAGACTGGGCTCCTGGGTTGGAACCTGTGGAGGCTGGCACCTGGGTGGTACCCAGGCCTTTCTGAGCCTGAGTCCGTAGCAGTTAGCAGATGATAGGCAGGGAAATCTTATTTCACAGGGCATTAAAACAGGAACCAAATGTCAGGGATGGGCAGAAGTCGGGGCCCAAAGAAGGGCAAAGAAAAGTGTCAGTGGCTCACGCATGTAATCCCGGCCTTTGGGAGGCCGATGTGGGCAGATCACGAGGTCAGGAGTTCGCAATCATCCTGGCCAACATAGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACGGGCGTGGTGGCAGGGCACCTGTAATCCCAGGTACTCGAGAGGCTGAGACAGGAGAATCACCAGACGGGAGGTGGAGGTTGCAGTGAGCTGAGATTGCTGCCCACTGCACTCCAGCCTGGGTGACAGTGCGAGACTCTGTCTCAAAAAAAAAAAAGAAAAAGAAAAGGTGTCTGCTGGGCTCGGTGGCTCACACTTATAATTCCAGCACTTCGGGAGGCCAAGGCAGGTATATCATTTGAGGTCAGGAGTTTAAGACTAGCCTGGTCAACATGATGAAACCCTGTCTCTACTAAAAATACAAAAATTGGCAGGTGGTAGTGGCGCACGCCTATAATTCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTATAGCCTGGAGGCAGAGGTTGCTGTGAGGGAGATCACACCACTGCACTCCTGTCTCGCCGACAGAATGGGCGGAAGAGTGAGATTCTGCCTCAAAAAAGGCTTAAAAAAGAAAAGAAAAATAGAAAAGTTTCAATGGCTCTATGTCATCTTGTCCCCTTCCTCCTCACCTTCTTCAGAAGTACGATTTAAATGCTGAAATTTTGGGCAGGGGTGACAGGGCAAGGATGTTGAGAACAGCTGCATCTGTCATGGCCCCTCCAAATCCCTTCACTTTCTGGAACTTCTGTTCTGGCTGCAGGGTCAGTATTAGCAGGCCTGAGGACATCCACAGGAATAAGATTATCGTACCCAGCGGGAAGCTCCATGGTGATCACTGACACCATTTACCTCTAGGAGGACCCAGCCTGGCCGGGGGTGAGGGAGTGTAATGGTTACCTGTGCCCGTGTGATTAGCCTGGATGGGCCTCAGCTCATCCGTCGCCCACTGCGTGTACTCTCATAGCGGCTGAAGGTACCAAGGGCAGGGAAGGTCGGGGGTCAAAGGAGTCACAGTATGTGGCATTGCAGACACACACCACCGAGCTGTAGCCGAAGCTTTTAGGGATGCAGGGGCTGGGTACCTGGGAGGAGGGAGTACAAGCAGAGTGAGGTCTGATGAAGACATGGAAATGGACACATCTGCTAGGAGAGACTGAACACGGTTTCAAAATTCCTCACCCTTGGCCGGGCGCAGGGGCTCACACCTGTAATCCTAGCACTTTAGGAGGCCGAGGTGGGCGGATCACCTGAGGTGAGGAGTTTGAGACCTGCCTTGCCAGCATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTAGTGGTGGCCGCCTGTAATCCCAGCTACCTGGGAAGCTGAAGCAAGAGAATCGCTTGAATCTGGGAGGCAGAGGTTGGAATGAGCCAAATTGCACCACTGCACTCCAGCCCAGGCAACAGAGGCAAGACTCTGTTTCAAAGAAAAAAACAAAATCCTCACCCCAAAGTTGGTCCAGTCACTCAAAAGGATTTATTGAGCACCTACTAAAGTCTACCACCAGCTTACTGGAAGGCTACCAAGGACTATGAGGCAGAAGGAGGCTCTGTGCTACCTCCCCACTGCCTTGACTCACTCACCTGATACTTGGTACACTGCCTAGTAGAAGCAATCCCTGTGAGGCTGCCAGCCTTGTTACCCTACTCAAAGGCTTGGGACATTCCTGAGGACAGAATGAGGAATGACTGAAAAGCAGCCCCTCTCCACCCCTCAACTACTCTCCTGGGCAGGGCTTAGCTGCCTTTGGGTGCCCATATTTGATCTCCCATTCATTAGGACAAGGCCCACAGAAACCTGGGTGCAGCTCTCCCTGCAACCCTTCTGATGACAACTCTCTGCCACCCAAATCAAGGATTGCTCCCCACCACTCTTCCCACCCATTTCAACTAGGCCCCTCCCTCCATCTGTGCCTTGCTCAAAGAGCCATGATGGCCCTGGATTCAAAGAGAGTCTGTCATTCATTAAATTCAGTGCCAGGATTCCAGAAGCACTGTGACAATGCTGATTGGGAGCTCTCTCTCTTACCTCTCTGGAAGGACTTGAAAACTCCATCCCCTCAGGGTCATTAGATGAAGAAGACCACAGGGGTTCCAGAGTCTCTGAAGGATAGAGGATCCACTAAACAAAAACAAGGATGCAGGTACCTGCCTTGGCTAGGCACTAGGTTAGCCCCTCAAGTAATTCCGGCTTCCCCGTGTGGATGGGTCATGTGATGACTAGGAGCGTCACACGACACAGGAGTGAGAGCAACCTGTATATTTCTAAAGGGCAATTGGCTTCCTCTCATCTGTTACAGATTATATGCCCTATAAAACTCTGGAGGCATGTATGGGTGACAACTTTAGGAGCAATATCAGCACCAACAGAAAGGTTAACACAAAGACACCGACAACTT %%#$%*.+))**'82($,29985+%163;9(0=CB@==899IF4.3'/52:??AC:ACB?:;?=+A?EH?CCI<9@597A<<27?:7;():9=>88B=9<AC>EDG<BDE<>?=874@<5B?BGD>8;,,325+0&&326EFF'?9:4>;;5258:;;<-%&(%%)%%('((.33/)565533/7<??+(09@123<4528032*((1./2+1,..9?<3:::42::>8@*)8536B?<960?A?;7;343522134.45&'994-@?:A;;?=<.51779(:01?@?9<>=B@99?>93IBB=<@@B7?<9;B67:;=;76:A==HCD89?4;4-6(((;@CBCFB:7@?:AACGE2+?01/=)1=8>A:7-)2??B>>GCB>;DC@<<<A<A=>:3C98<:>?=>;><10+*5-098..B+(?=:9D24?::C?BA@BA;>=:258;B?877A66>12%+2-;%:3+999;B?C?<',5;:>@?8;761CB2<B88:>=C9/&C96-,/C8;8>993))<**(/B@ABAE:;987-51.?4700GCABAC?A230*,).,42=;@=D@=AC>796:@><00CC>?9:DF@FDA==8923<OK=8:?6$$%69.<-.9@@<877(/0.10?@=6886.0//0925@95*.11484/31:::>;:>A?C7-754ACB=?>672BCB>//(6')))'*$&%*#&()*(0-3./-4>:?@@C<=<<;7534*4-$'$$&$$%$%#%$$$+$%+)&-460606>BCG><8-.+-04-BA;?@;>>4<8=>@;9556;<=:;2.'%()','(450B@D>;39:;96417<=14517B912257$2?A<88-+/9./.-(*0*0((*+3/**1*,*%/&23?<D=-8/3+../-3011F?:03/76()4*5>>1-$%%':@94B583;;A=?FG1A=?;5201578(*(()'-;EF;/++0ABAGB7FCF=?8><?95'5;>G;?<>>A;03.))'*$(',./?06?B:>?@=@B/54@2EA?@81@EF953::>@B><@?<CAEGC>AA>DE?>?>DFC>?=AHFB:/2<-@<CB=:3.795,%&''4978;1<?8B58-.>838@>AB8:<BBB;6;=22,%'&5&+24(0?-9B9;;;;6200)'454;;</6:9@@A6:87:7:=C?3/=;97<1''()6<7AA8C?=5338,3@HBABA=C>BC@>;84BBEA@;<77:5.1/2+,AA<=>AF=-()7435/8><:5DA8578<B::=+(+40$;=976314=B:7=47<33A=-**9;:<7*8+-),49<><=-6%%)198:EB=66$6+,6AAAAB=EEJD=?9:::CA<C>42/21?:IP@A<B2+2'==@<:)+/21'*88;?>9788???:A;?>C>485:;79:;>GFFC8<=:<=01;:6@?<9<<3+4.28=4'(239,44$%%'55547<:<:<76'4:;>8(5@<=B???9BEA=>?578814938<9;<5375202-/;;9;;76;459:,)8:48466:<@7;<<;96=:;C>>6000++)..774$),,4//5=><DFD:E@CCC7@9;6;@AD?<>EDAEADBCC:@BDD=>5<>+?FIFCDKJEJEEI<6<517><<:872*%-';;>?3*+611);452<3>>@?EE:8%)>=>;;+F>=C?9:<5-0-,5)>?A6BCB>8<(%%*&>??==%.1,4%##%%%-7852<1833C=FCCB;??64844).'(-,-,-79;771,)225.4/0.+...3,,AAG2+A@?;8)%)-%(11:4050/*/+/3*.%**9A?=?:??@3;/8?+0AALKD=>C8>B44EB76466?67<<B?C:;;-@8:8@>75-*72.2%+6DDCA58<ECD4:>7;7@99A><692%$'%/3227?<44;%9::CKF?E@CD@CAA7==@K68-,+>A;4>>CEBC@/1;:6>AD?FD.(@<7<;8>@@<?-%+.-/7@:A;HD9:3,$%&&1)0@CBEDAC3:?-C/:E?AC>8DCFEF;=:-2+++84<H?@@<?:?1<@PDAE8:DDB23&1AAFCFC>BEJD<=?DH8;D>=8AF::>?E9;<@<7B<=71?:?<B<>BG<AAFH<6>A*(=2122EBDDC@SJ;95=99B@CA77>A8?8<>20<:<A0<.8+0:CBCCEAA=.%*+5<?&>;B:13-5C?GJIA@@99A(93>36=;:?CA*?'?CDD23694.0864+*.1?,97?@JHF5AA>HHKDC:0<+1?DC>?B4-++(/63617/32?<=>@B99;?9;@@E=@A<;5<6;DB@=;??/&)456-9>=3/446<;???C3;@>>>A?CAD9:CF?@?B78:A;647A=<75425)-5??DBA<<B@:<A>C>05((+'$&)%=99993+%%%.<A?C7;6>=81A-+&)'*455>7:.-31,-.9;>?@B><A?4@CDG?HC=@<<?9;B6612=B?D83%$'%+9<8C?=B;;7;:<7;GEC=?>BADCB@3-849<86008;5;6@8>;6'')$&&1159?>564/.-,*<FF>A;=044<9;>?(,4558=@CB?@::>89:67001<344/-,1/0/9,;=?<<94/.32:BL>ED?@<>=<@:;??:6(@?:1;?@BCAEEGE:4B@>22;27<7C6:A679/7A4*FD--5-184%$9::6ADB3960AEAAG=9:.-7<?@<5,9@>894/051-9<35+CJGQHF89:7@@A<=C8>?997?+<>;;<<8))0322))&2018=AA7<CCEC1:8;;878827/>:?<'(4B5CEBBEB@63A.5HFJEC5.:-1A>:311FDA<**:07>459D<>A/6/<4++,28*+36A?<H@?4@<26442595;>=31@;;?-2>DCH<BE798;253%$$-.,05AA??;>6.5?04A67:<).1*8<<;>77D<;2612=?HLB;?3/@112+.-(3-)24,...*$37@IAC=FBFH3734?G<>B<:?8/7>A?=9?B>:/-.423)?'?AAC::?@5ACA>DFGIDBECEFEA90?ADG<B@:=>JBEHAE@9:%'+*--21:=;:A>?CD9:=8?H?LGM?A9;9:<BAAF?=6;8EA?==988'#(>>=C;C=>,-2-1FG,'(<BEDD8;;?920%%*,,+4%.@A@>@@5788976@;ABC@=>HACC.0)158@>B?D3>3>BE@>A?CFIAA>EBA1/0=(&$*8=?<**78A@A>D7<><BBD8A<-//3C>>@AGGD9$8((6@AC<@>>:3;34,233-6=7:@@A<:C?@?B?ADACDD;?;=8=88@8@@;:A46967GB=BFB9=>CA>?@DJD<>>5171;@A6./8=:84555AB1313+.>8<?@:?>7A5;FGI=-)ADCC569C5.*>=5/?<<**?@:;63<=90$919%'58776+/=7I8<34,28/*+.CC>??0G?GE'E<AA<BB(8<2?56>@8879=@<7>EB@<::0(9AEBAA1<%&/6>9@C=<<D>DH@>2(23-.9<=<9=.8)''$+$'*>>A><35*%,:9)5>:9*.*/&'5<311%%051;>>360,13-'(*.7;44DCGHCB;?;=>@5AC1&>735,.86;=@??5926:<8722(0/988@57ADAAB6>II>',**4:?:755;9@CB;65<:4=;=2*:6:B7B5'>B-5??481BA;3$$;;=?AFBDLC/A+>IB@9>=<DE9*1AC??C:51&>A?;>@7;*99=?B1-3F8;358&&;09;8=7@=?NJD35:;;C7?AD?;9:>?>=5=6664?9<EA:@DE=813/)/4>2:B@:856>>F86/<;AADAIBBCCAC>=??/=?B@=0-?CBCB<>E;546456=@::6(88-///28=2B,$,AB:'+?H;F:<CCHDEH@688?6$9-=;76.CBB?>C:;3866&24;;015@>B68FC637/056%-*'&($&&#)343%-6$)3:31<C@CGC3798BD@:1<<8;A=<45B:8.'&$%47:>-)7---12:>@?9?@BJFFCCKG>3A<?==DDA=8==8>@2-<.6::;9:99<8&:)06+;+:$%1367=FAEG9B?>;((@CA>DB=?B?A<@@7$$*A82A>:./86EE/::>A8;%%;:9:(++568@A(+'&(-0/--3&&4;0??A91&(:BB=314<*;:>IJTFEAD8==IE==5FE7;5<;<5%+(+%%584+&:932@>?<>+.832+,>;;45>=;37<;<=D?FDDHJA@=EECHA>;>>@A<5@9?8C9:=:BC88<;@CD&C8675;=<A<?B=;==9/%,+2?6;>=8:G<@?69*>=>CDE=A:>>@A@C-5>=+;/B99:-/7.;7A@?;??<8;:8598;878AA976862707DH?B@=@DD58<:96731-,,+/4392.279.)(.++-04<;;=;:BCB5EEF?<=@DCEACA@@CB@UJ?C>I=FDLG=FGM@58GIFFH9?=A>A7F;GBFIIE8;:AAG<<AD5@;44(/8**)$,..7?9=29><@<;:1,:&E@=@;:7=7&:<;>DH=-7)*))',817305=7>=?F;<=@BB@GFBJCC@BCFMD87=;;74662))+069=@A9;<>FHFF@CNHPGCFED<A@BCJDA>A@@2-.106*=@@/8465A<:GF<<885?;:5677===IJBD>;1:?8D@A18.?BF7/>5@AC=BA8@28::;:;:?ACBIGD@<9BACLD?4)-&.341=GD=,02;BEEBBAHGJ>A<;C?58ESQ>9=>:A@LG@?=5;9>BG?//4/(7<A?FIGE<>/A2,<4A:=4;CEDE>7583=5BB?F<>@@BBEG?91=6$%:<879C:9>C4*/>@?CCC1593@?<7A=<2774)+)+',/263:;/55D&F<=18A=?3>;;C<?@;>860)359;,%%%+&,,89:DD;=>53AB<==;+%<<679())21/:4;<@B9;FC8><-?&<78735@BG<779;248451$('',2.<DC<<3300((,074:>>ABC;BA9;:404ED11::>:=:>A?75652>,-/50:?@;=@:<:48437FIG?>12),+0'0;>:<=113901@AG@@7A262/;+<=A=B?=?GA2,=:6)$%,78@@9H8EB=B@<+,+8;<BB>E@=<E5+2&=E;&&*'1BCCHEI@G;@?17;:>,,4++(5..7?>?9>=AB(=B@$);/-4@DGB4:@BGFC@C=940?:E?;>DBB8?:-74>?<6.(+*.6B@C@BC.-2;/7$)AF@C:B;'%542,,/39-6=>D:DB8BE?>5<7699(49@?ABHGK==:6<@A<85%',.(:<CC8CDBCD=22;(>A:2:82?@<=GC@G7:(''.''*29?@=25;>56;@2@CC@DFEF;BFCBBFBC9($?57+0<AG<89:@GIC5?:F;>C93'))CIF<1.38%%;5;;6>EC@B@887.)6%>BG;&+6;>2945F?70+3+*C576AG8?>@D57FINTRGD@<4DEC6$>:3+99<<?9FMQ6*%532-2ADIA=>BBBA<CGD<B<;0+?F?*:3=IGC@56,,:7<=?;=0?>;<<03/';<<>>4@>72)0---84438=CE>?92-:;DGE@@D=MFA:((/=?><0@648AC?>AFDA9=:9HCAD=56CCB-1DF@JHKDC((=@8B@DBFA<=>AD7;94*'398>=99<CBB++</:::;@EMCAFC>:?C86?>@A??@@GH9<F85A:<?<>74>E>AABDB:;B><8@/**'1)/@@>>;=A549647-3,67>>AA9;;<;=EC<=4AB:=@A@>C7721,*?>=6=@>8-1'**22*+9532=@?=87120%$/13-('5',;?=15=7<>KF<<AC<(2=19@00'13;13&&&*053A2.FAADE-,=;@5?=>D312+)@CF=C>C<2?B>ADHCDBA9:@=?GD?@D66DCE@LKB?44B&+@8@>;CDBDB=BCCDC7:944<?C<DCD=6:9:E<GGH)83DA=>@AEHCDFAK=<2)>>:,5//&./10548<EC5;<5<4??<=A%=89CIPG???BA49:;7BBBCB4:5E?@:1-+.'*:9%%(15<<IGIFBCCBA><::<68>B?EJG;;.7.)+,)0298:51C<==?A?<;8:569=;6;B?=9:64@B@;8:=:,938ABECAC;>6:17CC@BCC+-;8<5334:7?@A@=A39A;A>A?:=DB;<C<6;?@A45577:?==D>>--:7..09:((4<??4:DCD37@GD00:?,?7GBEA?B;5:/2)%&'$$&0,&33482*'<=@=B<:<DBA,9),/AA;=4521*;2<600.84+2;-:<.'0>CCA@IB>;?32BCBA:;<6<E>=7;@<;;:9770$%%+/3333A7>BA@@><:9====>@56D::=@???G>24'<<GFA;><?A,'/2-5>=;@@?9@BI=:7;988:2203418<;>6@D8C>@G>?9==<?@FFBD;@9897414&+:2422IN;.5,,>>6@@/(B9F@@=61<?>EAD@E=;9BBE?A<8:44@9&*$)$*+366;34,(&&?:9GGJ93>E<@A?I>I<==6>92:G9=199:>C>890>>AA;22:)+)(.-.=?CGCD:44*=:./-CGFC6//2)--1,$&##%'++;=?-2>00(%*((,*..066<><@D>6?><?B<>D?(0<;8:EFIG@E?IJDA60=4(.''7@,7H<?@=8==HCFHAEAB1;-639C<;4271$*....57>CF=>C>F?<>2--0E>=:=<<D6442.;6:BAD<=76DEB=9<7872,''?54.*->54,),9??CD65<:;B619'.>4:??@61'&&&#<AB@B?DL;8?;<7:7=C>AE9:=>BE@=@D??@@<=8@ABB>9<5A;G>;=POHDHHC@@:=?4<AA;A=>@?>?<9>:A=DC?<A?C:ICG<;CGOH?6?7>=0/&&+-*/'09:DGAHEC<?9@>A=<?8/FA?@?G9@;DDC97CBAB>9I;?9=8GF74ABB@4=;=>;27CIAEC>'+>GEC%8-6DEHABFDAB@:<83219=CC9DCFBFDEC5?:<DB=A<<;=<;C<@@2DF;>=8','*/65=A>?>C:B=<43;:BC51*'$##&*@AE52*(GKJ,0E>NB;9''7;978:=;@EA41+$/99@;<89766<=644>B''9139:>=<AACGD?J::727-7==><>JG;*,)''''()-9BFMG=5:;51D8>?>6/1,:AB78/<DD4D971&,%-.025)*'*+3405)$$(005460>99922;1124:;@AD<;(-G/(;A>9BCCEFDH=A@10A37/61%+#.339;:=BA==?9:=EB:89B<83B7;<;;<=647//)'&%/0/328%);5,+50<?>E02AA=4DB;8<5113536761&,(1,4365C<8-355C:HB68/5'1:9=7;61--%=?>:89;+,269A?GDDDED@<DA=>401;(??F?AFD<=3.&$#%#%&,9;=B9=BFGGB<?B=97>8>==B.AE@CBGFBFE?A?<@=B=7<;;==B93?:;9<'>7>16=@5,1?=/.?=;:88748CE=??BCC??B,--+.---260CA<@=?>KBBB1174.$$$(<?@D<@=<AA9205///.*$'*,%/::;@CGB<;>?04+4:8=7>>AD=<C>@51A9A<>?2/00-7443*+,.<=;=><05***(59:/01852;2;=@@EDH@=@F@C=??7B<;*+>C;<944:DCFF35?8>=9A@AB@2@?<;?,A@>><)),>$<BB>AEB:H58BC921*,-?BC>=>B@:?;8955/2483/6815@BABBACAA=A@?EIAABJED?>34.14@AD?A@A>?DD?D@@HRHDCACB?=@@@FJNJGHCHC??4:4@EGC;AAA;D5CEDDE=CJFCG@A>??>=AE23)%)+0.;?CBB<G@G;<76HE>@>6?87=>@?B;>;/;@CB=B<?C@;>37..005AD?@?305;B;:D5;6/%+(&+/4-.--)$%)$$&&366<@9@@99CC3&DE@I???;ABAB89@C;DC;87BC428;663)179>CCA*8??2*2.)*03:8<844;64$18;=@B:1:?>:99B??LD;EB:=?13//@>>4-+>--,:;78>DE1=>:22:7A?3F=E@H@:=;8&#&&)<@GD99@;587;>8?/0&'09@EDC=:,&&&&#%%*())?<;-'*3230422?=@:;>4@8+5)+;97.79:3-*''-+/;BDD?;>E?=<&.844438BCAB9:897:6D<E3H1DJ<708:3.==<<<=@=99466;<;<34/99<@E@EC2@2-)1))9;>9=C@A::1D;?=I6-DCE<<;>=>$%865,5?:::@;36(26;?B+=DFE>>?=CEGBBC;=KFG6C98:;6888<@?><9=4?:=;@C4:.5))/::;=@61.44+9-.;9$:?53.3%,,037;26)).)&(.4011028:@A9@@EC>6::-,,,2=>7<122.$%'./22389=A>C;77-$(///9CAF<;IJ0*2H@??CA;@9<=;AA=?>=4967)8///6:@=@=@?BBCBBA),87,=:?9,10.184@A&0@=IJADE@CBFHI77A>/-)>=B=6:=<:>A<8?9+=-A>@?@D>CDC@B9<6049>40.71@@@:<>8:=@@:<7>@D=BDFGBCF?><822)>:;@=0;07?<=<%(6**1/;+<6::4;;>3)*-:3C5565;<>?)6FGFI;:9;FI?A>83A=;579?9=C<8+302:FDFA=:64=92(8:97%$('(.=;>=97;:><=?=<:-%>9BEFDC=@C=>:=7=<;CBE@?A>;.<883?C?CB>IDIBAA21':99))'1)*))%$$4DDHAD?/DE/*@GC<=1DAGCGE:;A)<?BBFC?G<33938=;6<A77<8==HDBFJHBBBD>>'23,5:@98<023-).5&&%/19=8;;9DBDG.)AC=@@?=;;:=8@3&198:<@>;>>?/6FEF<=?E><=+++ AS:i:201 NM:i:29 XI:f:0.8564 XS:i:0 XE:i:201 XR:i:193 MD:Z:15T3^G10T8A0A28A6T8^A10^TT0G1A1^C0C29^G1^G15C0A10T8G1A4^T1^A0A14 SV:i:2 SA:Z:1,155202292,+,86S49M3I18M1D11M1I1M2D26M1D14M1D42M1D9M1I17M1I95M1D99M1D24M2D8M1I4M1D99M1I2M1D55M1D65M2D4M1D2M1I8M1D5M1D32M1D17M1I4M2D33M1I15M1D6M2D4M1I2M1I80M2I36M1D7M2I108M2D16M5D20M1I2M1D4M1D42M1I45M1D25M1I2M2I55M1I118M2I20M2I48M1D116M1D14M1D67M1I29M1D108M1I4M1D8M1D29M1D3M1D1M1D29M1D62M1I33M1D38M1D137M1D27M1D35M1I80M1D6M1D32M2D5M1I26M1D22M1I82M1D35M1D31M1I34M1D50M2D70M1I153M1I15M1I14M1D33M1D139M1I2M2I31M1D5M1D65M1D25M1D6M1D95M1I61M1I25M1I1M1D9M1D13M1D83M1D65M2D34M1D31M1I20M1I38M1D3M1D135M1D40M2I12M1D8M1D19M1I4M1I13M2I87M1D13M1D102M2D25M2I8M3D240M1D97M1D1M1D53M1D8M2D186M1D2M1D15M1I17M1I74M1D78M4I124M1D77M1I44M1I14M1D114M1D50M1D55M4D18M1I15M1I72M2D1M1D75M1I19M1I12M1D62M1D70M1D54M5D27M2D59M1D73M1I32M3D5M1I151M1D7M1I1M1D34M2D14M2D8M1I24M1D2M2D1M1D2M1D41M1I3M1D48M1I2M1D70M1D44M1D90M1D8M1D15M1I50M1D1M2D32M1I2M1I42M1D13M1D10M1I140M1D72M1D20M1D44M3I29M1D17M1I57M2D14M1D2M2I12M1D99M3I21M1D14M1D64M1D2M1D11M1I41M2D2M4D6M1D61M1D81M1I12M1D41M1D49M1D236M1D35M1I15M1D4M1D32M1D33M1D32M1D18M1D46M1D6M1I9M2D12M1I18M1D1M2D2M1D59M1D61M1D13M2D70M1D3M1D8M1I4M1I32M1D81M1D108M2D68M251S,36,395; QS:i:8705 QE:i:8898 CV:f:2.157389

wdecoster commented 4 years ago

That looks like a very long read. Perhaps you should try alignment with the --bam-fix argument

Laurenz0908 commented 3 years ago

I have the same problem using ngmlr version 0.2.7 and samtools 1.11 --bam-fix did not help It occurs on several lines, one is for example: m54067_210510_000837/8520076/128701_137638 2064 tig00002724 1759331 -2147483648 8840S51M1D61M * 0 0AAAAAGCCGCATACACAGCTTTACGCCGTCAGGTTCCTTCCATTGATATGCAAATTCCAGAAGATATCGCGGTAGAGCAATCCGATTTCTTGCAGGCTCTCAAGGAATCAAACCAGCAGTACTTCGTAGCATGGAAGTTGAAGTTCCTCATGTGGAATGGGAAGATATTGGCGGCTTGGAAACCATAAAACAGACCTTGCGGGAATCTGTGGAAGGGGCTTTGTTGTACCCAGAACTCTACAAAGAAACAAAAGCCAGAGCGCCCAAAGGAATACTGTTGTGGGGTCCTCCCGGTACTGGTAAAACATTATTAGCAAGGCTGTGGCTTCTCAAGCCCGAGCTAACTTTATTGGTATTAATGGACCAGATTTACTCAGCCGTTGGGTGGGAGCCAGTGAACAGGCGGTAAGAGAATTATTTGCTAAAGCCCGACAAGCAGATCCCTGTGTAATATTTATCGATGAACTGGATACATTAGCTCCAGCACGGGGAACTTACACTGGTGATTCTGGGGTAAGTAACAGGGTAGTGGGGCAATTACTGACGGAGTTAGATGGTTTAGAATCGGGGTCTAACATTTTGGTAATTGGGGCTACCAATCGTCCTGATGCCATAGACCCTGCTTTGTTACGTGCAGGACGGTTAGATTTACAATTGAAGGTAGATTTACCAAATTTAGATAGTCGGTTCAAGATTTTACAAGTTTACAATCAGGGCAGACCTCTGTTAAATGTGGACTTGGAACATTGGGCTAAGATAACAGAAGGTTGGAATGGTGCAGATTTAGTATTACTCTGCAATCAAGCCGCTGTAGGAGCAATTCGCCGTTTTAGGTCTCAAGGTGAAACAGATACTGCTGCGATCAAGATTACTGTTGATGATTTCCAAGCTTCTTATGAAGCTTTAAGCCAGCAACGTACAGTTTAGTTCAACCTACGCTTTGGGGCAATTTATAAATTGCCCTAATCCTCTGCTTAAAGAACTGCACAAGCGGCGGGTCGCCCGTACGCCAGAACTCGCTAAGCGCTAAATTGGACGCAATGAGGACAGACCACTAAAACCTGGTACGCAGTAATGTTTCCGTCCCCTGACTGGGAATTGGTAGGGTTGGACTCTATAGGTTTGAATATTTTATCGAAAATTTTGGGTTTAAAACCCCGTCGTTCTACGACGGCTTTTCTTGGTTTTTAATGTAAGACTTCAGAACCTCTAATGGTGCGCCACCGATTGAAGATACAAAATAACTAGGCGACCATAAAGATTCTTTTCCGTAGGGTTTAGGGAATCTGGCTTGCCCATATCTACGACTGTCAACTCCTTTTAAAGCATTTACTATGACAGAAATAGAGAGTTGAGGAGGGTATTCAACTAAGGTATGAATATGGTCTGACTCCCCATTAAATTCCAGCACTTGAAAATTCATTTTTTCAGCCACTTCTCGAAAATACTTCAGGATTAACATCAGGCTCTCCACTGTCAAGATTTGGCGACGATACTTAGTCACACAGACCAAATGAATCTTTAAATCCGAAACAGACGCTCTTTTCGTCAAAGGGTTGACATCTACCAGACTCCTAGCTATAATTAATGCAGACTCATACATTTTTACAGTATGAAGCTAAGATATCGATATCGAATTTACCCAACAGACCAACAATAGAGATTGATGTCTCAATTGTTCGGATGCTGTCGGGTAATATTTAACGATGCCTTAGCATATTGTCAAGAACAATACCGTGCGGGTAACAAAAAGCCTAATATCAACGAACTTTCTCAAAGACTTACAGAACTCAAAAAAACCGAGGAAAAAATCTGGTTGGGAGAAGTTTCTTCTATTCCCTTACAACAGTCTTTAAGAGATTTAGAGCAGGCTTACTCTAACTTTTTCAAATCCTGTAAAGGACAAAGAAAAGGTAAGAAAGTTAAACCTCCCAAGTTTAAAAAGCGTAAATCTAAGCAATCCGCTAGATTTATGGATAATGGTTTTAAACACCTTAATTCCGATTACATTTACATCGCTAAAATTGGTGATATTAAAGTAGTATGGAGTAGAGAGTTACCCACAAAACCTTCTAGTGCTACCTTAATCAAAAACAGTGCTGATAGGTATTTTGTCAGTTTTGTTGTTGAGTTTAATCCTCAACCCTTACCGGAAAACCAAAATTCTGTAGGAATTGATTTAGGAATCACTGATTTTGCCACATTAAGCAATGGTGATAAACTTAAATGTCCTAAGCCTTTAAAGAAACAACTAAAGCGTTTAAGAAGATGACAACGTAATTGGTCGAGGAAACAGAAAGGCAGTCAAAGAAGGGAAGTTGCTAGAAAGAAATTAGCTAAACTTCACGCCAAGATTTCTGATACCAGAAACGATTTTCTTCAGAAATTGTCAACTCGGATTATTCGTGAAAACCAAACGATAGTCTTGGAAGATTTAAAGGTTTCAGGGATGATGAAAAACCGAAAACTATCCCGTGCTATTTCGGATTTGGGTTGGCGCAGTTTGAGAACAATGCTAGAAGTTAAATCTGTTATGTACGGGCGTGATTTTCGTGTCATTGATAGATGGATTCCCACTTCTCAAACCGGTTCTGGCTGCGGTTTTCGCGGTGGCAAAAAAGAATTAAATATGAGATAGTGGACTTGTTTAAATTGTGGCACATCTCATGATAGAGATGTTAACGCTGCGATTAATATTAAGGTCGCCGGAGGGCATTTGGAGACTTTAAACGGATGTGGAGAGAGGGTCAGACTTTCTGTAAAGAAAGCACATCTCAATGAAGCGTCAACCCGTCCAGCATTTCAACAATTGTCAATCTTTGATTTGTTGAAATAGATGGAATCCCCGTTAGCGCACGCCGGGACGGATAGGCGGATGTCAATACAGTCCTCTGTTTCCTAAAGTGCAACTATTTGCAGTTGGGAAGGAAGGCGACAGGCTCACACAGAGGAAACAGCATATGCAGGGCTGACGATCGCTTTAGTGGTGGTAGGTGTCGCGTTTGAGCGAGTCGTAGTTTGGGGTGGGTTGGCTGTGGGAAACGCAGTTGTCTGGTTTTGGATACTTAGGCGATCGCATCCCCGTTTAGTTAGTCGTCTTTTAGAGGCTATTGCCCGGTGGGAAGTGCGGTTGTCTGGTTTTCTAATCTTCGAGAACCCCAAGGCAAGCTTAGTAGGGGCGGGTGCGCGATCAGCCACGGGAAAAACCAGCCAACTATATACACCCGCCCGCGGGTGCGCGATCAGCCATCGGAAACACCAGCACCACTCGCACACAGACCATGCCTTGGCAGCGAAACGCCCAGAGACCCAGCGAACAGAGGAAAAACCAGCCAACCCCCTCAGTACAGGACAAAAAAGCTGAAAGTCTTAGCCAAAAGGGAATCTAGCCGAAAATTCTGAAGGGGGCGCTCCGGGCTGAGGTGAAAGTCGAGGGGTGGGGGGTAGCGCCAAGTTGGGGTGAGAGGGTTAGATCGGAATTACCACAAACCAAACTTCCCTCTCATGAACGAGCTTAACCGATTACGAGACACTTTGCGCCCTCACTTGCCCTGGCACGGGGCGAGATTAAACTTTGTCTGCCTGTTCCTGATGGCGCTATTCCAAACAAAGACGGTTAATCTGATGGAAATAGCGACTGTATTCGCAAATCCTGTGCAAATTTCCTCAAATTACCAGCGATTACAACGTTTTTTTCGGCAATTCAAATTTGACCGGGCAGAGATTGCCCGTTTCGTCGTTAGCCTCATTGACATTCCCCAACCTTGGACTCTTAGTCTCGACCGCACCTGTTGGTCTTTCGGTCAAACCCATTTCAACATCTTGATGTTGGCAGTCGTCCACGAGGGGATTGCCTTTCCCCTGCTGTGGAAGATGCTTGACAAAAAGGGCAATAGCAACAGTGGCGAACGCATGGACTTATTCGACCGCTTCGAGGCACTATTTCCTGACGTGGAGGTGGCTTGTCTGACCGCTGACCGGGAATTTGTGGGGCGAGATTGGCTCTCGTATCTTCTCATCGACCCCCAGGTTCCTTTCCGCCTACGCATCCGCCACAGCGAGCTGATTAGTCCTAAGTTAGGAGGAACTCGGCGTAGCGGCGAACGAATGTTTGATTCTCTGCGACCCGGAGAATTTCGCCAGCTTTCGGGTCGCCGTTGGGTTTGGGGACGGCAGGTTTACGTCATTGGCTCTCGTCTGGCTGATTCGGGGAGTTGTTGATTCTCATCACTAACGCTTGCCCCGAAACGGCCCTCCCCGACTATGCTCGGCGTTGGGGTATTGAAAACCTCTTCGGAGCCTTGAAGACTCGGGGCTTCTGTCTCGAATCGACTCACTTTAAGGACCCTGAGCGCTTGAGCCGTTTATTGGCTTTGCTTAGCCTGGCTTTTATAGCACAAGACAGAACACTTAGGACGTATAATGGAGAGGACGAGATGACTAAGAAGACCAAAAATGCCAGCCCCCTATAGTTACGACCTCAGACAAAAAGTTATTGATGCCATTGAACTAGACGGTATGCCCAAAACAGAAGCCAGTCAAGTTTTCCATGTCAGCCGGAACACCATTAATCTCTGGCTGCAAAGAAAAGCACAGACCGGAGACTTCCTCCCTAAACCTCATCACCGACCTGGCAATAACCACAAAATTACCGACTGGCAAAAATTCAAGGCTTTTGCCCAAGAGCATGGCGACAAAACAGCAGCTCAAATGGCTGAACTTTGGGATGACGACATCTCTCCTCGCACCATATCCAGAGCCTTGAAGAAAATTGGCTTCACCAGAAAAAAAAACTTACGGCTACCAAGAACGTGATGAGCAACAGCGAGAGGAGTTTATGGCTCAGATTGAACAGATGGAGCCACAAGAAGTGGTCTACCTCGATGAAGCCGGCATGAATAGTCAGGACTCGGATTACCCTTATGGTTACTGCGAGGAAGGAAAACGCTTCCATGCACTCAAATCAGGGAAGAGGCAGGGCAGGGTAAGTATGATAGCCGCATGGTGTCATCAACAACTCTTAGCTCCCTTTAGCTTTGAGGGTTGTTGTAATCGGACAGTGTTTGAGTTGTGGTTGGAGTTCATCTTAATTCCAACATTGAAGCCAGGTCAGACTCTAGTATATTGGACAATGCAACGTTTCATAAAGGGGGGCGGATTGCTGAACTGGTGGAGGCAGCTCAATGCCGTTTACTCTATCTTCCGCCTTATTCGCCAGACCTCAACAAGATAGAGAAATGTTGGTCGTGGCTGAAAGCCCGTATTCGCAACTGCACGTGAGCAGTTTGATTCTCTCCATGATGCCATGGATTCCGTTCTCAAAGCTGCGTCCTAACCACCTTGACTAATGCTATACTTGGGCTATGAAGGTGGGTTTGTGGATTCACCAAGGTTCACCCATTCCTTGGAAGGCTCACGGACGACGCTCCCAGAGTCTTTTCCGCACTGGCTTCGATTTTCTACGCCGCACTTTCTCTAATCTGCCTTTGTTTTCAGGGCGGTTTCACCAGGCTCTACAACTTTTGTCCTGTACTTAGGCCAACCCCATACACCAATTATTAATTATTAATTATTAATTATTAATTATTAATTCCAGCCAACTATATACACTCGCCCCCAGGGTGCGCGATCAGCCATCGGAAACACCAGCACCACTCGCACACAGACCATGCCTTGGCAGCGAAACGCCCAGAGACCCAGCGAACAGAGGAAAAACCAGCCAACCCCATACACCAATTATTAATTATTAATTATTAATTATTAATTATTAATTCCAGCCAACTATATACACCCGCCCGCGGGTGCGCGATCAGCCATCGGAAACACCAGCACCACTCGCACACAGACCATGCCTTGGCAGCGAAACGCCCAGAGACCCAGCGAACAGAGGAAAAACCAGCCAACCCCATACACCAATTACAGCCTTTACACAAAACACAAAGTACGGTATAATGTATGGCTATTGTATAAGTTATCACAACCAACTACCATGAGCCGTTACTCTTTAGATTTTCGGAAAAAGATAGTAGAAGCCTATGAAAAAGGAGACACTTCTATCCGAAAAGTAGCGAAGCGCTTTTTGGTGAGTCCAGACACGGTAAGGCGACTGGTCAAACAGTATCGATTAACGGGAGACTTATCTCCTCGTAAGTGTGGCACTAAAAAGAAAAGCATTTGATCTCAGCATGAGGAAGCCGTGATAGATATTGTAGAAGCCCACCCCGACCTGACCTTATGGCAATATAGTGATAAGCTCAGAGACAAGCTGGGCATAAATGTCAGTACGACCATGATAGATAGATTTTTAAAGCAGCACGATATAAGTCTCAAAAAAAACATACAGGAGCGAAAAAGTAGTAACTGAAGAAGTACAGAAAGCGCGAGTAGATTATTGGTAAAGAATTAGAGATGTAGCTCCTGAAAAGCTTGTGTTTATTGATGACAGTGGGTTATGGGTAGGGATGAGCAGACCGTTAGCCAGAGCGACGAAAGGGAAAAAAGTCTATGAACTGCGGAAATCCTATCGAGGTCAAAAAATGACAATAATTGGTGCAATTAAGCTCTCCGGAGTGGTAGCGACTCAAACATTAGAAGGGTCCATGAAAAAGAGGATTTTCTCCAATTCATCAAGTTAGATTTATTGCCAAAATTAAAAAAGGAGATGTAGTGGTAATGGATAACTGAAATTCTCATCATCGAGAAGAAGTCAAAGAAATGATAGAGTCAGTAGGAGCCAGGGTAGAATATCTGCCGGTGTATTCCCTGAGTTTAACCCGATAGAGATGATGTGGTCACAGCTCAAAAGTTTAGTGTGCAAGTTCAGGACGGAGACCATGGAATTATTAGTGAGATTGGTAGAAGTGGCGGTAAGCCTTGTGGATTTACAGTGTTTGAATAACTGGTTTACCAAGTGTTGTTGCTGTGCTTAATGATTGAGATAAAGCCTGTATTAATTATTAATTATTAATTATTAATTATTAATTATTAATTATTAATTCCAGCCAACTATATACACTCGCCCCCAGGGTGCGCGATCAGCCATCGGAAACACCAGCACCACTCGCACACAGACCATGCCTTGGCAGCGAAACGCCCAGAGACCCAGCGAACAGAGGAAAAACCAGCCAACCCCATACACCAATTATTAATTATTAATTATTAATTATTAATTATTAATTCCAGCCAACTATATACACCCGCCCGCGGGTGCGCGATCAGCCATCGGAAACACCAGCACCACTCGCACACAGACCATGCATTGGCAGCGAAACGCCCAGAGACCCAGCGAACAGAGGAAAAACCAGCCAACCCCATACACCAATTATTAATTATTAATTATTAATTATTAATTCCAGCCAACTATATACACCCGCCCCATAGGACCCAAACACCAGCACCACTCGCACACAGCTCTGCCCTTTGCACGGTACGGCTACAAGACTGGTAGAAGAGGGGCGGGTGGTAGTAGGGGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCGCAGGTAGGCTTTGAAAACCCGCCCTTGCGGGTTCCGCAGTCAGCCATAATTCCCACAAATCAACTTTAAAAACCCACCGACGACGATGCGCCTAAGACCATGCACCCCACAAACAGCACAGAACACGCGCCAC * AS:i:219 NM:i:1 XI:f:0.9912 XS:i:0 XE:i:219 XR:i:112 MD:Z:51^C61 SV:i:2 SA:Z:tig00002724,1750485,-,105M1D209M1D2741M1D383M1D770M1D915M2I1477M1D49M1D109M1D2080M112S,22,10; QS:i:8840 QE:i:8952 CV:f:1.251117

fritzsedlazeck commented 3 years ago

@Laurenz0908 just to check is this HiFi data ?

Laurenz0908 commented 3 years ago

@fritzsedlazeck yess, and as reported in #83 the read has a negative MAPQ value so I think it's the same problem

fritzsedlazeck commented 3 years ago

Mhm I need to check if that has todo with some samtools update.. thanks for reporting. Fritz

pi3rrr3 commented 3 years ago

Hi, if useful, I face the same issue using ngmlr 0.2.7 and samtools 1.10, with standard PacBio data and two distinct libraries. Faulty lines also show negative MAPQ values: m54214_200325_095631/4260092/119840_123419 2064 Scaffold04 8785531 -2147483648 ... m54217_200324_032638/4260231/60987_76267 2064 Scaffold14 4429481 -2147483648 ...

As mentioned earlier by @marcotoffoli, using samtools 1.9 solves the issue.

Euphrasiologist commented 3 years ago

Can verify also that downgrading samtools also worked for me...

splaisan commented 2 years ago

Hi! I have today the same error with or without --bam-fix, --rg-id and --rg-sm all set. Is there an update and fix to have ngmlr work with the current samtools?

ngmlr 0.2.7 samtools 1.15

skr3178 commented 2 years ago

I face similar issue when running this tutorial https://github.com/fenderglass/jax-meta-tutorial minimap2 -ax map-hifi assembly_graph.fasta SRR13128014_1gb.fastq.gz -t 30 | samtools sort > graph_alignment.bam samtools index graph_alignment.bam

splaisan commented 2 years ago

I face similar issue when running this tutorial https://github.com/fenderglass/jax-meta-tutorial minimap2 -ax map-hifi assembly_graph.fasta SRR13128014_1gb.fastq.gz -t 30 | samtools sort > graph_alignment.bam samtools index graph_alignment.bam

Hi @skr3178, this is a recent recurring issue with samtools installed from bioconda and conflicting with other libraries (ncurses, openssl, ...). A fix is to use a non-conda copy of samtools installed on your server (works here for me)

skr3178 commented 2 years ago

Thank you @splaisan . I used a different reference file and it seemed to work. I didn't try the solution you posted but that may also work.