intelligent-environments-lab / bleed-orange-measure-purple

Data and Analysis related to the Bleed Orange, Measure Purple low-cost sensor project on UT's campus
GNU General Public License v3.0
2 stars 1 forks source link

Adding TCEQ rows to PurpleAir Heatmap #51

Closed calvinjlin closed 4 years ago

calvinjlin commented 4 years ago

Hagen mentioned it might be good to add the three rows of TCEQ data to the heatmap.

Primary Objective: add tceq data rows to heatmap Difficulty: low to medium

Steps

  1. Assess the state of TCEQ PM2.5 data

    • [x] Is it in parquet format?
    • [x] Is it in the same frequency as the destination script or does the script have an easy resampling function?
    • [x] Will a pivot be necessary?
    • [x] Is the data in central time?
  2. Import data into script and process it.

  3. Combine or append to PurpleAir data frame

  4. Plot and save.

Optional objectives: Identify and refined data pipeline for TCEQ

calvinjlin commented 4 years ago

Notes

  1. Value error when using pd.concat.
  2. Data pipeline for TCEQ averages both sensors into just one column, may want to undo or leave as is?
  3. Better data structure needed for TCEQ data which may have different parameters per sensor (how to combine into one file/make more compatible with PurpleAir stuff?).
  4. For future consider moving from seaborn to plotly for heatmap since it allows for formatting of pandas datetimes(I don't have to downgrade dates to string format in order to make it look good).