pmelsted / bifrost

Bifrost: Highly parallel construction and indexing of colored and compacted de Bruijn graphs
BSD 2-Clause "Simplified" License
201 stars 25 forks source link

GFA output: DA field meaning #37

Closed tillbaar closed 3 years ago

tillbaar commented 3 years ago

In the GFA output, all segments carry an optional field with the tag DA of type Z (printable string incl. whitespace). What is the meaning of this field? At least in the output files I generated so far, this field always contained an integer >0.

I hope I didn't just overlook the explanation somewhere in the documentation. Also, I am blown away by how fast Bifrost is. Awesome work!

Thanks, Till

GuillaumeHolley commented 3 years ago

Hi @tillbaar,

The optional field "DA" stands for "Data Access": It is used by Bifrost to associate unitigs to their color classes when reading a colored de Bruijn from disk. Hence, it is only output if a colored de Bruijn graph is created.

Thank you for your kind words, let me know if you have any questions or suggestions about Bifrost.

Guillaume