ComparativeGenomicsToolkit / cactus

Official home of genome aligner based upon notion of Cactus graphs
Other
511 stars 112 forks source link

cactus-pangenome error : OSError: [Errno 7] Argument list too long: 'cactus_consolidated' #1276

Closed tanger-code closed 8 months ago

tanger-code commented 8 months ago

Hi.

I'm creating a pan-genome graph of thousands of bacterial genomes using command : cactus-pangenome ./1668-js-9 ./seqfile.txt --outName KP --outDir 1668-kp-9 --reference GN202435 --giraffe --vcf --viz --odgi --gbz --gfa --vcf --logFile 1668.log --workDir ./workDir-1668 --mgCores 80 --mapCores 80 --consCores 80 --indexCores 80 2> 1668.stderr. But there is an error in the last: image What might be the cause of this error? Is it because I use too much genome( about 1500)?

glennhickey commented 8 months ago

There are some known issues with regards to scaling about about 500 genomes. We'll need to get this fixed, as we'll soon have more than that many human genomes in the HPRC, but I don't have a timeline yet. For now you'll have to subsample or use a different tool.

tanger-code commented 8 months ago

There are some known issues with regards to scaling about about 500 genomes. We'll need to get this fixed, as we'll soon have more than that many human genomes in the HPRC, but I don't have a timeline yet. For now you'll have to subsample or use a different tool.

Okay, thank you. I will try to subsample them later.

fpichon commented 8 months ago

Same error... Is this linked to getconf ARG_MAX and is there a work around using linux config ?

EDIT: could it be avoided using find ? https://www.gnu.org/software/coreutils/faq/coreutils-faq.html#Argument-list-too-long