Closed shenweima closed 3 years ago
I'm sorry, currently this App is in Beta and still under development. Obviously I've found that your NCBI database is splitted into multiple sub files and I haven't dealt with this bug yet. I'll fix it in future. Currently you could try BLAST database in non-split format such as:
CS_genome_v2.0
CS_genome_v2.0.nhr
CS_genome_v2.0.nin
CS_genome_v2.0.nsq
I will be interested too in this updated. I think you can just modify the run_blast.py script indeed. In a local copy I just masked lines 39-39 of the script. Maybe the statement could be made more flexible and include also db.*.nhr ?
@aariani Thank you for your suggestion. However the main problem is that if we run blast (v2.9.0) like this:
blastn -query query.fa -db CS_genome_v2.0
,
it would automatically delete CS_genome_v2.0.nal
and instead re-build CS_genome_v2.0.nhr, CS_genome_v2.0.nin, CS_genome_v2.0.nsq
, which seems rather confusing and time-wasting!
The thing is that the wheat genome is too big and BLAST automatically creates split genome database regardless. Otherwise the search will not be efficient with a single blastdb.
The wheat genome is huge, pretty much a single chromosome is the size of a regular plant genome actually
Well, do you mean such split databases were automatically created by the makeblastdb
commands? I don't have wheat genome now, and I only test very small datasets. When I run blastn
, it automatically deletes the nal
file and again re-build the normal database. Strange.
@shenweima @aariani This issue has been fixed now
Web UI still needs to be fixed
NCBI database
CS_genome_v2.0 is genome fasta file.