lh3 / seqtk

Toolkit for processing sequences in FASTA/Q formats
MIT License
1.35k stars 310 forks source link

subseq empty output #189

Open SingleCellCRM-user opened 2 years ago

SingleCellCRM-user commented 2 years ago

I am having a similar issue as described in #145 but with subseq. Filtering the fastq file with a small name.lst works fine (under 10M reads,1GB), but when I increase the name.lst file (around 500M reads, 15GB) the output is empty. I thought it could be a problem of running out of memory, as suggested in #145, however I am in a computer cluster providing 16GB of memory for this job, and the maxvmem used is only 13GB. Is this a maximum from seqtk itself?

My plan to carry on is to divide the name.lst into smaller chunks, and then merge the output file. (should be the same shouldn't it?)