flatironinstitute / deepblast

Neural Networks for Protein Sequence Alignment
BSD 3-Clause "New" or "Revised" License
114 stars 21 forks source link

Running time for large query sequences #143

Closed Citugulia40 closed 1 year ago

Citugulia40 commented 1 year ago

Hi,

Thanks for developing this tool.

I have 250 seed sequences and I want to search 2 million sequences against these. So, I have built the database of 250 sequences using tmvec-build and now I am searching the query proteins against this database using tmvec-search. It is running from 10 days and the output file "tabular.txt" will be made in the end. Is there any way that I can see, how many sequences from 2 million are able to run till now and how much time will it take to complete the search?

Thanks in advance.

mortonjt commented 1 year ago

Moving over to the tmvec page