edanalytics / edu_edfi_airflow

Manages extract-load of Ed-Fi data in Airflow
Other
4 stars 0 forks source link

Feature/earthbeam student ID package #58

Closed sleblanc23 closed 1 month ago

sleblanc23 commented 1 month ago

Description & motivation

This PR adds the ability to use the new student ID earthmover package to the EarthbeamDAG. It adds three optional tasks to the file_to_edfi_taskgroup:

It also modifies the run_earthmover task to add the parameters required for the student ID bundle if needed. It is currently in draft status while testing continues.

Usage

Three new and three existing task-group level args are required to use the student ID features, and one is optional:

Because the student ID process uses project composition, earthmover deps will need to be run to install the packages. More details on this to come in a migration guide. For a preview, see the testing instructions linked below.

Breaking Changes

The argument database_conn_id is renamed to snowflake_read_conn_id for clarity. It also creates an earthmover parameter called SNOWFLAKE_CONNECTION instead of DATABASE_CONNECTION to align with the student ID bundle.

PR Merge Priority:

High, once testing is complete

Tests and QC done:

This has been successfully tested in GSN and testing is in progress in TX. Instructions for testing can be found here.