wandreopoulos / deeplasmid

12 stars 2 forks source link

Hits to plasmid proteins #10

Open vanhoan310 opened 7 months ago

vanhoan310 commented 7 months ago

Dear authors,

I wonder how you define and compute (Boolean variable) a hit to database, e.g., plasmid_proteins?

Thanks

wandreopoulos commented 7 months ago

Hello, It uses minhashing in order to define hits to databases. Specifically sketch is used from bbtools to minhash a database and then map the kmers quickly. See: https://www.seqanswers.com/forum/bioinformatics/bioinformatics-aa/60935-minhash-sketch-a-tool-for-rapid-sequence-comparison

On Wed, Dec 6, 2023 at 12:22 AM vanhoan310 @.***> wrote:

Dear authors,

I wonder how you define and compute (Boolean variable) a hit to database, e.g., plasmid_proteins?

Thanks

— Reply to this email directly, view it on GitHub https://github.com/wandreopoulos/deeplasmid/issues/10, or unsubscribe https://github.com/notifications/unsubscribe-auth/AANGW5PKV6PF3GKX5A37WBDYIATL7AVCNFSM6AAAAABAI7CT7GVHI2DSMVQWIX3LMV43ASLTON2WKOZSGAZDOOJZGU4TGNI . You are receiving this because you are subscribed to this thread.Message ID: @.***>

-- Thanks, Bill


William B. Andreopoulos, Ph.D. Joint Genome Institute LBNL