lh3 / seqtk

Toolkit for processing sequences in FASTA/Q formats
MIT License
1.35k stars 310 forks source link

support >2G sequences #192

Closed c-zhou closed 1 year ago

c-zhou commented 2 years ago

Hi Heng,

The major changes are (1) return types of ks_getuntil2 and kseq_read in kseq.h - from int to int64_t, and (2) the definition of uint64 *a in reglist_t struct in seqtk.c - from beg<<32|end to (beg,end) pairs in (2n,2n+1) positions.

Best, Chenxi

lh3 commented 1 year ago

Thanks for this, Chenxi! I have merged the PR and will use it more in my daily work. Your change will be there in the next release.