dssg / appy-reviews

A "smart" Web application for reviewing DSSG program application submissions
MIT License
2 stars 1 forks source link

periodic automated ETL #61

Closed jesteria closed 2 years ago

jesteria commented 5 years ago

ETL should be run periodically by an automated process to (re)load data into the database.


In 2019, ETL was run during the application stage from the management CLI, with variations of the following:

manage etl --all --stage=application

…and in the review stage:

manage etl --all --stage=review

Note, however, that Appy does not ship with the repo's manage.py. The above CLI commands wrap and document the applicable underlying commands, (run through docker run --rm --user webapp -eDATABASE_URL -eWUFOO_API_KEY appyreviews_web):

./manage.py loadwufoo -v 1 --stage application -

./manage.py loadapps execute

and:

./manage.py loadwufoo -v 1 --stage review -

./manage.py loadapps --closed execute
jesteria commented 4 years ago

ETL is now documented here.

jesteria commented 4 years ago

This work was begun in 2020, with the intention of utilizing AWS Batch to execute ETL, using the existing Docker image. Provisioning of the periodic job(s) would be controlled by manage command(s).

See: https://github.com/dssg/appy-reviews/compare/jsl/auto-etl