Closed padilla410 closed 2 years ago
I end up with an error when trying to parse the files
Error in make.names(col.names, unique = TRUE): invalid multibyte string 5; debug with scmake("7a_temp_coop_munge/tmp/Iowa_DNR_LimnoProfiles_2000_2020.rds.ind", "7a_temp_co...
I will try to track down which file that is. Perhaps a windows/mac difference on one of the parsers/files?
Iowa_DNR_LimnoProfiles_2000-2020/Iowa_profiles_2017b/17129263ysi.csv
causes the error for me
it is addressed by adding the fileEncoding to read.csv
read.csv(x, fileEncoding="latin1")
in the else
of parse_2017_2020_data()
I just did a before/after with the new HTML and the old one. Iowa is on the map for PGDL now 🎉
OK! I resolved all of the comments and successfully ran scmake(7a_temp_coop_munge)
locally (options(scipiper.dry_put = TRUE)
). I intend to do a run that talks to gdrive after a finish a few summary tasks for a 3 pm ET meeting today.
Data summary:
all_coop_dat_linked.feather
(64% of the lakes in the raw data)all_coop_dat_linked.feather
(~81% of the records in the raw data)I think the major reason for the loss here is the mismatch between the raw data and the existing lookup table. The existing look up table has 132 lakes while the raw data has 176. We could easily pick this up if we had the rest of the spatial info.
A few checks:
the outputs:
Snapshot of
8_viz/out/lakes_summary_fig.html