lh3 / ropebwt2

Incremental construction of FM-index for DNA sequences
MIT License
68 stars 5 forks source link

The basic command outputs wrong result. #8

Closed akp6031 closed 1 year ago

akp6031 commented 1 year ago

For the attached file with symbols below: G (51374), C (35983), A (33513), T (32130) SRR20044276.part_001.part_001.fastq.gz

Running the command prints the weird result that A:T and C:G share the same frequencies and wrong. The output file is consistent with the printed result.

./ropebwt2 -o out.bwt SRR20044276.part_001.part_001.fastq 

symbol counts: ($, A, C, G, T, N) = (2000, 64643, 86357, 86357, 64643, 0)

I am receiving such similar results for any input. Could I be advised for the practice?

lh3 commented 1 year ago

-R