flatironinstitute / deepblast

Neural Networks for Protein Sequence Alignment
BSD 3-Clause "New" or "Revised" License
109 stars 19 forks source link

Running time for large query sequences #143

Closed Citugulia40 closed 11 months ago

Citugulia40 commented 11 months ago

Hi,

Thanks for developing this tool.

I have 250 seed sequences and I want to search 2 million sequences against these. So, I have built the database of 250 sequences using tmvec-build and now I am searching the query proteins against this database using tmvec-search. It is running from 10 days and the output file "tabular.txt" will be made in the end. Is there any way that I can see, how many sequences from 2 million are able to run till now and how much time will it take to complete the search?

Thanks in advance.

mortonjt commented 11 months ago

Moving over to the tmvec page