Closed btylerburton closed 12 months ago
Deployed prod image successfully using airflow standalone
Airflow is working, and we are seeing logs!
Draft PR is parked at https://github.com/GSA/datagov-harvester/pull/2
README and other cleanup TBD.
After much struggle with the Docker image, we pivoted to using the Python buildpack and were happily greeted with a healthy scheduler.
PR is here: https://github.com/GSA/datagov-harvester/pull/3
User Story
Datagovteam would like to implement a production quality airflow installation using the LocalExecutor. This is necessary in order to establish a baseline for Airflow performance in Cloud.gov, and will allow us to compare performance against other executors more quantitatively.
Related:
Acceptance Criteria
[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]
cf login -a api.fr.cloud.gov --sso
and authenticate AND I runcf target -o gsa-datagov -s <development|staging|production>
WHEN I runcf push airflow --vars-file my_vars_file --strategy rolling
THEN I expect to see the instance of the airflow admin panel when I visit the expected cloud.gov internal URL AND I will see my test DAG loaded here.Background
Given the overwhelming number of options for configuration, establishing a performance baseline with a LocalExecutor configured for production, with an external RDS DB, will allow us to benchmark other solutions more effectively.
Cloud.gov also supports building applications from images, and given that we work with containers locally and have been traditionally supporting a both an image-to-container solution for local developement and a buildpak-to-container solution for production, then this POC will attempt to unify those two environments.
Security Considerations (required)
[Any security concerns that might be implicated in the change. "None" is OK, just be explicit here!]
Sketch