vgteam / vg

tools for working with genome variation graphs
https://biostars.org/tag/vg/
Other
1.07k stars 191 forks source link

Unified parallel GBWT construction #4221

Closed jltsiren closed 4 months ago

jltsiren commented 4 months ago

Changelog Entry

To be copied to the draft changelog by merger:

Description

This PR updates GBWTGraph to include unified support for multithreaded GBWT construction (see jltsiren/gbwtgraph#36). The main idea is to partition the graph into GBWT construction jobs using gbwt_construction_jobs(), create the final metadata with MetadataBuilder, pass reference paths with assign_paths() and insert_paths(), build partial GBWTs for the jobs in parallel, merge the GBWTs, and add metadata from MetadataBuilder.