calblueprint / ace-ny

4 stars 0 forks source link

[feat] Connecting NYISO and NYSERDA to Populate Database #16

Open itsliterallymonique opened 5 days ago

itsliterallymonique commented 5 days ago

This sprint, you will be connecting the data from the NYISO, NYSERDA to populate the database! For this, please make a separate python file and keep in mind that we will be adding the data we scraped from ORES to this function. Here is what you will need to do:

  1. Create a file that will hold the function(s) you will use to connect the NYISO and NYSERDA data by project name and/or interconnection_queue_number
  2. Adjust your NYSERDA web scraper to also collect interconnection_queue_number
  3. Create a function that connects all the NYISO and NYSERDA data by the project name and/or interconnection_queue_number and add it to a json / csv file (whichever you prefer!) a. When you do this, check if the NYISO has any data that can help populate any NULL columns in the data we scraped from the NYSERDA b. If there is a project that is not in the NYISO (or vice versa), still add it to the dataset. c. We will need to assign projects with the State Senate and Assembly Districts. Here’s how you can find the districts:
    • Find an API that can use address (you can get this from google maps reverse geocoding) to find the state senate and assembly districts (a free API)
    • We can add to our database the counties in NY + their corresponding state senate and assembly districts then use the data to assign the districts based on the project’s county
  4. Make a new test table on Supabase based on our “Projects” table and try to write all the data into it (to check your web scraper works so far!)
  5. For now keep key_development_milestones and image NULL (I’m still sorting some details out with these so you will add them in a later sprint!)

Some resources to help!

PR Reviewer: @itsliterallymonique and @ethan-tam33

deenasun commented 17 hours ago

Questions:

For NYISO: