marbl / meryl

A genomic k-mer counter (and sequence utility) with nice features.
119 stars 14 forks source link

Support for printing top x % of most frequent k-mers #14

Open cjain7 opened 4 years ago

cjain7 commented 4 years ago

There is already an optiongreater-than N for printing kmers that occur more than N times in the input. It would also be nice to have a similar option most-frequent x where x ranges from (0,1]. This option would return top x fraction of the most repetitive kmers. Similarly least-frequent x may also be useful.