tsdataclinic / smooshr

Tool to consolidate entries and columns from multiple datasets
https://tsdataclinic.github.io/smooshr/
Apache License 2.0
14 stars 4 forks source link

Investigate different models for describing an analysis flow using a DAG or similar structure. #83

Open stuartlynn opened 4 years ago

stuartlynn commented 4 years ago

We currently only have 2 types of operation on smooshr

  1. Combine columns together
  2. Create a taxonomy for a given column

In the future we would like to have more steps for example

Some of these steps will have dependencies on previous steps that are hard to predict at run time. It would be great to have each indiividual transform be defined as a node in a graph with dependecies linked by edges. Essentially a DAG.

This would inform the UI and the python code that is ultimetly spit out by the tool.

Some links to projects that might be worth looking at