Closed adyork closed 5 years ago
I have to do this for the ProteOMZ proteins dataset so this will get done during my normal DM duties for https://www.bco-dmo.org/dataset/737620.
Done, works easily enough in dataflows. Nice regex functionality to specify which columns to flip. see notes in https://github.com/BCODMO/Proteomz-Saito/tree/DM_structure/737620/3#tips-learned for https://github.com/BCODMO/Proteomz-Saito/commit/9fec20be211b66357e3f45ce0aab0dcdca435fa5.
However, we don't need to use dataflows for this, pipelines has unpivot too.
Make a dataflow .py for pivot/unpivot and see if it will work for our identified use-cases.
If this works out we can add flows directly into the pipeline yamls.
Example custom flow called from a pipeline: https://github.com/frictionlessdata/datapackage-pipelines#dataflows-integration