ccao-data / service-spark-iasworld

Service for extracting tables from the CCAO system-of-record and uploading them to the Data Department's data warehouse
GNU Affero General Public License v3.0
0 stars 0 forks source link

Enable extracting iasWorld test environment #13

Closed dfsnow closed 1 month ago

dfsnow commented 1 month ago

This PR modifies the Spark extractor to allow extracting from the iasWorld test environment. Previously, we only extracted from the production environment.

This setup is necessary to facilitate https://github.com/ccao-data/enterprise-intelligence/issues/131, which requires pulling data that is only loaded into the test environment.

Data pulled from the test environment is put into its own dedicated database in Athena, iasworld_test. Note that the test environment doesn't replicate the production environment; not all tables are available in test, and the most recent year of data is not present.

I've already successfully run this pipeline, so you can find the test data in Athena under iasworld_test.