Open LongTianPy opened 5 years ago
Hi, currently we don't support saving the index on disk, but it's in my TODO list. BTW, hope you are using multi-threaded execution. But, no doubt that indexing would take majority of the time.
Thanks for the reply. But I was looking for saving index of each reference genome as a physical file.
I support this feature in the future! Would love a full index of refseq for easy querying...
Came here to say I would use this feature as well!
Maybe one day... !
Did this ever end up getting implemented?
No Sorry I don't have cycles for this.
watching this thread, having several hundreds MAGs to compare to ~30k genomes, saving the reference index would save an unimaginable amount of time. Cheers !
Hi sir,
While I was querying one genome against ~2000 genome, it was very slow. I checked back the paper on 90k prokaryotic genomes and found indexing would take the majority of the runtime, so I wonder if minimizer of each genome can be saved (like sketch or signature of MinHash) and doesn't have to be recreated every time?