mahulchak / quickmerge

A simple and fast metassembler and assembly gap filler designed for long molecule based assemblies.
GNU General Public License v3.0
192 stars 31 forks source link

merged.fasta the same as hybrid_assembly.fasta #25

Closed peterdfields closed 6 years ago

peterdfields commented 6 years ago

Hi,

I'm trying to run quickmerge with an assembly generated with dbg2olc and canu. I run both the wrapper script and individual steps as described on the readme. I don't receive any errors. I do notice that the resultant merged.fasta has the same content exactly as the hybrid_assembly.fasta. I can also see that while the summaryOut.txt and aln_summary.tsv seem normal anchor_summary.txt file is empty except for a header line. Please let me know if any additional info would be useful in determine why the merging doesn't seem to be working.

esolares commented 6 years ago

Hi,

Thank you for your email. Could you please share your entire run options/parameters/script for running nucmer and quickmerge.

Thank you

On Nov 26, 2017 8:47 AM, "peterdfields" notifications@github.com wrote:

Hi,

I'm trying to run quickmerge with an assembly generated with dbg2olc and canu. I run both the wrapper script and individual steps as described on the readme. I don't receive any errors. I do notice that the resultant merged.fasta has the same content exactly as the hybrid_assembly.fasta. I can also see that while the summaryOut.txt and aln_summary.tsv seem normal anchor_summary.txt file is empty except for a header line. Please let me know if any additional info would be useful in determine why the merging doesn't seem to be working.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/mahulchak/quickmerge/issues/25, or mute the thread https://github.com/notifications/unsubscribe-auth/AEI6vn62vrEAM7sUJTzjVHgwryA8LiYMks5s6ZXkgaJpZM4Qq1lS .

peterdfields commented 6 years ago

Sure. First I sourced the .quickmergerc, then:

for the wrapper: merge_wrapper.py hybrid_assembly.fasta self_assembly.fasta

which finished with no errors.

Then with individual steps as:

nucmer -l 100 -prefix out self_assembly.fasta hybrid_assembly.fasta
delta-filter -i 95 -r -q out.delta > out.rq.delta
quickmerge -d out.rq.delta -q hybrid_assembly.fasta -r self_assembly.fasta -hco 5.0 -c 1.5 -l 400000 -ml 5000

which also didn't show any errors. Both resulted in the same output content described above.

mahulchak commented 6 years ago

Can you see the alignment in the out.rq.delta filea with mummerplot ? You can do mummerplot -fat -filter out.rq.delta

On Sun, Nov 26, 2017, 10:14 peterdfields notifications@github.com wrote:

Sure. First I sourced the .quickmergerc, then:

for the wrapper: merge_wrapper.py hybrid_assembly.fasta self_assembly.fasta

which finished with no errors.

Then with individual steps as:

nucmer -l 100 -prefix out self_assembly.fasta hybrid_assembly.fasta delta-filter -i 95 -r -q out.delta > out.rq.delta quickmerge -d out.rq.delta -q hybrid_assembly.fasta -r self_assembly.fasta -hco 5.0 -c 1.5 -l 400000 -ml 5000

which also didn't show any errors. Both resulted in the same output content described above.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/mahulchak/quickmerge/issues/25#issuecomment-347026878, or mute the thread https://github.com/notifications/unsubscribe-auth/AHMD6KYhvULQ00r_3yYkUAlDSLJ_7abKks5s6agygaJpZM4Qq1lS .

peterdfields commented 6 years ago

Yes. Would it be useful to upload this file?

mahulchak commented 6 years ago

Sure. If you can share the delta file, that would be great. Is 400kb n50 of your self-assembly file?

On Sun, Nov 26, 2017, 11:14 peterdfields notifications@github.com wrote:

Yes. Would it be useful to upload this file?

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/mahulchak/quickmerge/issues/25#issuecomment-347031030, or mute the thread https://github.com/notifications/unsubscribe-auth/AHMD6LTzP2Mh-0wjWAHC7FDU2_hhqE2Rks5s6bhGgaJpZM4Qq1lS .

peterdfields commented 6 years ago

You can download the file using the following link: https://www.dropbox.com/s/5qlld3k2m0n4da4/out.rq.delta.gz

And the 400kb n50 is from the self-assembly file.

mahulchak commented 6 years ago

Hi, I checked your delta file. There are no overlap that shows >5 HCO value, and therefore no merging. Are you sure the assemblies are from the same strain/species? Mahul

On Sun, Nov 26, 2017 at 10:37 PM peterdfields notifications@github.com wrote:

You can download the file using the following link: https://www.dropbox.com/s/5qlld3k2m0n4da4/out.rq.delta.gz

And the 400kb n50 is from the self-assembly file.

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/mahulchak/quickmerge/issues/25#issuecomment-347044424, or mute the thread https://github.com/notifications/unsubscribe-auth/AHMD6BDmby2kt_j-GrKamRy1AjJKtQBvks5s6egHgaJpZM4Qq1lS .

peterdfields commented 6 years ago

The reads almost certainly derive from the same species. One point that could be important is that the self_assembly.fasta is almost 50% larger than the expected genome size, which is probably the result of redundancy in the assembly. Do you think this redundancy could result in the type of whole genome alignment results being observed here?

mahulchak commented 6 years ago

If there is high homology between the two assemblies then it should show HCO >5. Will you be able to share the fasta files so that we can investigate this a little more. We will use your files only to understand the problem and will not share it with anyone else. Let us know.

peterdfields commented 6 years ago

Do you have a preferred email address where the files could be sent to?

mahulchak commented 6 years ago

Please send them to mchakrab[at]uci.edu

On Mon, Nov 27, 2017, 15:14 peterdfields notifications@github.com wrote:

Do you have a preferred email address where the files could be sent to?

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/mahulchak/quickmerge/issues/25#issuecomment-347359287, or mute the thread https://github.com/notifications/unsubscribe-auth/AHMD6NByXXsWc7f8NZ7s0ID1B2gMnn77ks5s60CNgaJpZM4Qq1lS .