wildlife-dynamics / ecoscope-workflows

An extensible task specification and compiler for local and distributed workflows.
BSD 3-Clause "New" or "Revised" License
0 stars 1 forks source link

Task: ETL - Column Mapping #34

Closed walljcg closed 6 days ago

walljcg commented 1 week ago

We need a task that allows a user to:

  1. select which columns to retain from a dataframe
  2. define the order of the columns
  3. rename the columns

Input to this task function will likely be a dictionary with the name remapping ask:v pairs, with the order of the dict defining the column order, and the set of keys defining which columns to retain.

The function should provide a warning when critical columns (e.g., 'geometry') are being dropped.