Closed ebioman closed 7 years ago
Hi,
you are right, NCBI recently restructured it's ftp sites, and the download links in the example aren't working anymore. I will try to fix that, unfortunately I don't have the time right now.
As for the seg-fault, that is an issue with the size of your genome, and the way I currently run minimap. Minimap cannot handle sequences >2Gbp or so. And I internally concatenate all contigs of a genome into a single sequence, in your case some 4Gbp. I have plans to change that behaviour, but again, not the time to implement them at the moment. Sorry.
Hi Thanks for the quick reply. I suspected already something related to the size of the genome. This is indeed a bummer as your approach looked very interesting and could have replaced mummer for many quick analysis. Cheers
Have a look at this repo https://github.com/zeeev/minimap, and the extension described in the example https://github.com/zeeev/minimap#running-example-gorilla-vs-grch38. This might also work for large genomes.
Hi Thanks for pointing me to that alternative method. It does work in general but oddly annotates (calculates) genome sizes of a multiple order of the real one. Cheers
Hello The examples you provide in the make file are not working anymore. I think it is related with the problem that Ensembl is not offering the entire genome as one file anymore (or I could not find it).
This is a bit troublesome as I had another error testing it on my data and did not know then whether it was my local installation or my data which caused it.