NBISweden / Earth-Biogenome-Project-pilot

Assembly and Annotation workflows for analysing data in the Earth Biogenome Project pilot project.
https://www.earthbiogenome.org/
GNU General Public License v3.0
9 stars 8 forks source link

Custom classes to properly maintain metadata #53

Closed mahesh-panchal closed 7 months ago

mahesh-panchal commented 9 months ago

This is a suggestion that may not be feasible, but perhaps using custom objects might make handling meta data easier.

The issue is that meta map manipulation is a common feature of workflows because due to using the nf-core way of passing meta data around the workflow. This often means meta-maps are manipulated putting things in and out to get the correct fields to join on.

Being able to use custom classes might make the metadata handling situation better. As for the most part, these are simply data stores. The primary issue though is handling file staging. This maybe possible though by extending nextflow's ArrayBag class.

It's possible to use custom classes https://github.com/mahesh-panchal/nxf-custom-object-test at least as input and perform operations on.

Potential meta data objects

Sample:

?

Read data:

Assembly:

mahesh-panchal commented 7 months ago

Processes are not designed to take the output