Closed RacheliHadjez closed 4 months ago
Hi Rachel, Your command lines look correct. Unfortunately, I cannot reproduce your issue using your command lines on small sample data (available in the repo). Does this produce output for you as well?
minimap2 -ax map-pb -t 20 --cs=long test/data/sample_layout.fasta.gz test/data/sample_reads.fastq.gz > aln.sam
racon -m 8 -x -6 -g -8 -w 500 -t 14 -q 20 -u test/data/sample_reads.fastq.gz aln.sam test/data/sample_layout.fasta.gz > out.fasta
A couple of questions:
-q 20
option from Racon?Best regards, Ivan.
Hi Ivan! First of all thank you for your reply! I ran the command you wrote but still it gave the same results.
I copied a part of it so you can see, it looks different than SAM file I generated with BWA (short reads) for example. Does the format I pasted look normal for long read alignment?
I think its not a problem of memory, or else it would stop running no? Maybe I'm mistaking (once again I'm very new to this). Thank you for your patience and help!! Sincerely, Rachel
Hi Rachel!
Thanks for getting back! Can you also copy/paste the output of Racon from your terminal if you still have it, so I can see where it is hanging? Did it reach the step with the sliding arrow that says "generating consensus"?
The SAM file you sent looks normal, thanks for sharing it!
Racon will try to load all reads into memory at the beginning of the process. Your FASTQ file is ~64GB large and the input SAM file is ~114GB in size. How much memory does your machine have available? If your machine doesn't have this much RAM (actually, somewhat more than this because of data structures and algorithms), the process could get killed by the system, or it could go into swap and become extremely laggy to the point that it looks frozen. In this case, the solution is to separate contigs from your reference file into multiple files, and extract only the alignments from the SAM file which map to each contig group, and then run them separately. (This can be scripted.)
Still, this does not explain why the small test example doesn't produce any output for you.
Can you rerun it with /usr/bin/time
so we can record the exit status? For example, like this:
minimap2 -ax map-pb -t 20 --cs=long test/data/sample_layout.fasta.gz test/data/sample_reads.fastq.gz > aln.sam
/usr/bin/time --format="cmd: %C\\nreal_time: %e s\\nuser_time: %U s\\nsys_time: %S s\\nmax_rss: %M kB\\nexit_status: %x" -o out.memtime \
racon -m 8 -x -6 -g -8 -w 500 -t 14 -q 20 -u test/data/sample_reads.fastq.gz aln.sam test/data/sample_layout.fasta.gz > out.fasta
After this, please copy the out.memtime
contents and the stderr output from your terminal (the output from Racon).
Best regards, Ivan.
Hi Ivan! Sadly I could not run it, and then we had a problem with the linux cluster at our university so finally I managed to run it on "galaxy" website and it gave me a great output! Thank you so much for your help!!! Rachel
Glad to hear it's resolved!
Best regards, Ivan.
Hi, I am very new to genome assembly, I have an assembly I created with NextDenovo with PacBio data. I used minimap2 (the command I used: minimap2 -ax map-pb -t 20 --cs=long /dorotheeh/hadjez/nd24.asm.fasta /dorotheeh/hadjez/FilteredResults_Q20.fastq > pb_alignment_minimap2.sam) Now I am trying to run Racon but the output file I get is empty, this is my command: racon -m 8 -x -6 -g -8 -w 500 -t 14 -q 20 -u /dorotheeh/hadjez/FilteredResults_Q20.fastq /dorotheeh/hadjez/pb_alignment_minimap2.sam /dorotheeh/hadjez/nd24.asm.fasta > nd_Racon.fasta
Can you help me understand what I'm doing wrong please? Thank you in advance, Rachel