shenwei356 / seqkit

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation
https://bioinf.shenwei.me/seqkit
MIT License
1.33k stars 159 forks source link

[Feature suggestion][amplicon] primer-pair searching limit #349

Open Subject-13 opened 2 years ago

Subject-13 commented 2 years ago

Hello!

Might have an idea for your tool:

Prerequisites

Describe your issue

It might be useful to collect only a certain number of hits for a primer pair from the given table and then remove the primer pair from the list so that it is no longer searched for in the reads.

If I already have 1.000 hits for a particular primer pair, it is no longer necessary to search for the pair. Search in the reads until the limit for each pair is reached or all reads have been searched.

shenwei356 commented 2 years ago

It does reduce search time when using many primers, but most of the time, we need to search all reads.

Subject-13 commented 2 years ago

We could use this for our workflow since the nanopore sequencer provides a lot of data, but a certain amount of it becomes redundant for us and just costs time.

I would try to do this adaptation by myself and could give you feedback if it's stable.