ndierckx / NOVOPlasty

NOVOPlasty - The organelle assembler and heteroplasmy caller
Other
170 stars 62 forks source link

AN INCORRECT FILE FORMAT! #201

Open wangwenzheng-agis opened 1 year ago

wangwenzheng-agis commented 1 year ago

someone can help me? use it to assemble the cholorplast but it gives me that answer,my fasta file was mapped to the whole genome and get with samtools fastq from the bam file ,is it the reason?

ndierckx commented 1 year ago

Hi,

I doesn't recognize the read ids. But have you tried to just use the complete dataset, almost always the best way to assemble. I would advice not to use reads that are extracted from an alignment..

wangwenzheng-agis commented 1 year ago

Hi, sir I solve the problem by using the complete seq, after that the result contain invalid basic group ‘R’ and ’S’,can you tell me what’s that mean, and should I delete it or in some other ways? wish for you William Wang

2023年3月16日 下午10:47,Nicolas Dierckxsens @.***> 写道:

Hi,

I doesn't recognize the read ids. But have you tried to just use the complete dataset, almost always the best way to assemble. I would advice not to use reads that are extracted from an alignment..

— Reply to this email directly, view it on GitHub https://github.com/ndierckx/NOVOPlasty/issues/201#issuecomment-1472123368, or unsubscribe https://github.com/notifications/unsubscribe-auth/AVXGQBVQFO55KMX2LEVVESLW4MRZJANCNFSM6AAAAAAV4XGAEI. You are receiving this because you authored the thread.

ndierckx commented 1 year ago

These are ambiguous nucleotide codes, if some software doesn't recognizes them, just replace them by an N