marbl / MetagenomeScope

Visualization tool for (meta)genome assembly graphs
https://marbl.github.io/MetagenomeScope/
GNU General Public License v3.0
24 stars 8 forks source link

Support general node / edge metadata #243

Open fedarko opened 1 year ago

fedarko commented 1 year ago

Arbitrary categorical / quantitative information associated with each node / edge. Something simple like TSV files would be sufficient. Maybe we have two optional parameters to the CLI (--node-metadata and --edge-metadata?), where each can refer to a TSV file where the rows are nodes / edges and the columns are metadata fields (e.g. TaxonomicClassification, Repetitive, ...)

The main purposes behind supporting this are:

  1. Enabling the user to encode the visualization with this information -- i.e. node / edge colors, or edge thicknesses, or maybe node border colors (?), etc.

  2. When we view information about a node / edge, show this information (along with existing node / edge properties, e.g. length, GC content, ...)

This information should probably just be stored in the same location as arbitrary GFA tag information (#238). We will need to do the standard sanity checking of "is the same metadata field name given in both files?", of course.

Support for this issue would supersede #8, #54, and #95. It isn't quite a replacement for #111, but I guess we could extend it to support #111 (I'll leave both this and #111 open for now).

Checklist

Python

JS