usds / justice40-tool

A tool to identify disadvantaged communities due to environmental, socioeconomic and health burdens
https://screeningtool.geoplatform.gov/
Creative Commons Zero v1.0 Universal
127 stars 42 forks source link

As an open source contributor, I want to have a user guide for adding data sources, so that I know what steps to take when I want to submit a PR for a new data source. #1468

Open widal001 opened 2 years ago

widal001 commented 2 years ago

Description There are many steps involved in adding a new data source to the ETL pipeline that populates the tool. It can be especially challenging to know which steps to take and to confirm that all of the necessary steps have been taken when submitting a PR for a new data source as an open source contributor.

Solution To simplify the process of proposing code that adds a new data source to the tool, it would be useful to have a user guide that walks contributors through all of the steps required to develop an ETL class for that data source and to cover that code with the appropriate set of tests. It would be helpful to have this guide live directly in the data_pipeline/README.md or in an new data_pipeline/etl/README.md

Describe alternatives you've considered

Tasks

Definition of "Done"

esfoobar-usds commented 2 years ago

Since there's a fundamental refactor for the score and dataset models using YAML that's upcoming, we want to wait until #1417 is done.

widal001 commented 2 years ago

@esfoobar-usds Sounds good! Thanks for flagging that for me. I'll subscribe to updates on that refactor to make sure I'm clued into those changes, and wait until you all are done with that.

emma-nechamkin commented 2 years ago

ahh! sorry i didn't communicate this when we last spoke, @widal001 :(