OrderMinHash bug fix from updating to sketch v0.19.1
Throw an error on empty sequences.
Improved handling of ram or memory sequences.
Since --parse-by-seq only needs sequences for edit distance calculation, we can free memory if running in --seqs-in-ram mode. Saves the trouble of caching the parsed sequences to disk, but requires more memory.
Lifetime management needed a bit of extra work, but it seems to be stable for both cases.
Since --parse-by-seq only needs sequences for edit distance calculation, we can free memory if running in --seqs-in-ram mode. Saves the trouble of caching the parsed sequences to disk, but requires more memory.
Lifetime management needed a bit of extra work, but it seems to be stable for both cases.