Closed elcortegano closed 1 year ago
AlignGraph2 can skip the completed processing for which the result already present in the working direcotry. Restart the program will restore from the last point. The long process time may be caused by the too many k-mer positons in extreme situation, and this is where the program need to be optimzed. You can try running from beginning with sampled reads, taking 2/3 or half of the total reads.
Thank you @huangs001 , that was very helpful,
I've got now some output files. I imagine that the extended genome file is final.fasta
, but nowhere in the documentation I find what are the other files generated (e.g. remainder.fasta
). What do these files refer to? I'm a bit confused, since the final.fasta
file only differs in that a few contigs have been removed, but no contig has been extended.
I'm sorry that I've been busy recently and haven't received the issue notification. The final.fasta
is equivalent to merging add.fasta
and remainder.fasta
, while add.fasta
includes the extended contigs and remainder.fasta
includes the contigs that were not extended. And the extended contigs are joined by different original contigs, the connect_info.txt
shows how to join.
Thank you for the clarifications!
I'm running AlignGraph2 for a mouse genome (~2.7 Gb) with assembly and input files near 1 GB in size, and high read coverage (90 GB of data). I'm running this tool in two different clusters, and in both of them, the last file modified is one of:
with different
X
number. The size of the working directory is also very different. However, both runs have been stuck now for several days. The commandtop
reveals that AlignGraph2 is runningpagraph
. This is whatps aux
shows about the process run:For this run, the last lines in the log file are:
This is the last line of log printed in four days. Is it normal for pagraph / aligngraph2 to take this long to run?
Could it be stuck? if so, is it safe to kill the process and restart it? would it restart from the begging or does aligngraph2 skip remaking files already present in the working directory and restore execution from the point where it was?