Hello, Yuansheng Liu.
Our group is developing a sequence aligner which is faster on reordered reads. We have tested three reordering-based compressors, SPRING, Minicom and PgRC. But Minicom always reports segment fault on large human dataset, for example ERP001775_1 size of 217Gbp. It is available from the links down below. The coverages of subsets constituted of files is from 7.2X to 34.6X.
The testing machine is equipped with 96 Intel(R) Xeon(R) CPU E7-4830 v3 @ 2.10GHz processors and 1T RAM. Gcc (version 6.4.0) is used to compile Minicom on operation system CentOS release 6.6. Minicom works well on low coverage. But it fails on the high coverage (ERP001775_1.fq constituted by 5 sub files above).
Hello, Yuansheng Liu. Our group is developing a sequence aligner which is faster on reordered reads. We have tested three reordering-based compressors, SPRING, Minicom and PgRC. But Minicom always reports segment fault on large human dataset, for example ERP001775_1 size of 217Gbp. It is available from the links down below. The coverages of subsets constituted of files is from 7.2X to 34.6X.
The testing machine is equipped with 96 Intel(R) Xeon(R) CPU E7-4830 v3 @ 2.10GHz processors and 1T RAM. Gcc (version 6.4.0) is used to compile Minicom on operation system CentOS release 6.6. Minicom works well on low coverage. But it fails on the high coverage (ERP001775_1.fq constituted by 5 sub files above).
We are appreciated if you fix the problem and help us complete the experiment results.
Thank you. Best regards. i-xiaohu (HIT-CS)