Adjusted pipeline where necessary & then built. Also added 30_data_summarize to capture some of the summarization that is used in gages-through-ages but also useful elsewhere.
This should fix #3. Below are some diagnostics about the pipeline artifacts from this particular build (week of June 1, 2020).
Number of combined daily flow observations, range of data, and breakdown of daily flow through time
nwis_all <- readRDS(sc_retrieve('20_data_munge/out/daily_flow.rds.ind'))
nrow(nwis_all)
[1] 233383058
range(nwis_all$date)
[1] "1857-02-01" "2020-06-05"
rm(nwis_dv, nwis_uv) # had to clear most other things bc it needed more memory
nwis_all_summary <- nwis_all %>%
mutate(year = as.numeric(format(date, "%Y"))) %>%
group_by(year) %>%
summarize(n_obs = n())
plot(nwis_all_summary$year, nwis_all_summary$n_obs)
Note: low point at end is because we are only mid-way through 2020
Number of active (>335 days) sites per year & year range of active data
Note: low point at end is because we are only mid-way through 2020
Number of unique actives sites through all time + number of continuous active sites & when the continuous sites were built (earliest year of continuous site data)
Adjusted pipeline where necessary & then built. Also added
30_data_summarize
to capture some of the summarization that is used in gages-through-ages but also useful elsewhere.This should fix #3. Below are some diagnostics about the pipeline artifacts from this particular build (week of June 1, 2020).
Number of DV observations and unique sites pulled
Number of UV observations and unique sites pulled
Number of combined daily flow observations, range of data, and breakdown of daily flow through time
Note: low point at end is because we are only mid-way through 2020
Number of active (>335 days) sites per year & year range of active data
Note: low point at end is because we are only mid-way through 2020
Number of unique actives sites through all time + number of continuous active sites & when the continuous sites were built (earliest year of continuous site data)