cucapra / pollen

generating hardware accelerators for pangenomic graph queries
MIT License
27 stars 1 forks source link

FlatGFA: Hand-rolled GFA parser #154

Closed sampsyo closed 6 months ago

sampsyo commented 6 months ago

The next bottleneck in GFA parsing was the external rs-gfa library. I replaced this with a hand-rolled one.

Using the same measurement setup as #153:

chr22 chr8
originally 28s 49s
after #153 13s 18s
after this PR 7s 12s

So that's another 1.9x and 1.5x speedup over the last set of optimizations, for a total of 4x speedup over the first version.

It's clear that the bottleneck now is in the memcpying to the destination files, which is also avoidable (with some compromises).