jordansread / hyperscales-data-release

data release repo for NE CASC funded hyperscales project
GNU General Public License v3.0
0 stars 1 forks source link

Multistate release changes needed following reviews #26

Closed lindsayplatt closed 3 years ago

lindsayplatt commented 3 years ago

Overall

1_spatial

2_config

3_observations

4_inputs

5_predictions

6_habitat

jordansread commented 3 years ago

Let's also make sure we cite LAGOS for depth data if we aren't doing that already.

lindsayplatt commented 3 years ago

In morphometry: "some depth data came from [LAGOS CITE]" Add to 2_config cross sites. Ask Sam Oliver

jordansread commented 3 years ago

For this

Update naming conventions declared in 4inputs for NLDAS files (inside inputs zips). They do not currently match. They look like NLDAS_time[0.359420]_x[284]_y[164].csv but the metadata says they follow nldas_meteo_N{minN-maxN}_W{minW-maxW}.csv. Similarly, the 1_spatial lake_metadata.csv file column called meteo_filename follows the metadata convention but doesn’t match the actual filenames stored in the inputs zips. @jread-usgs find the function from mntoha that rewrites these filenames and apply here.

I adjusted the file names in the metadata file to match the actual file names (the time range was off, which is an issue we had elsewhere too) and uploaded that file. Then I modified the metadata yaml text which was copy-pasta from mntoha, which had updated templates for the meteo file names that we're not using here. That should take care of this one. The xml was manually uploaded. The csv for the metadata was uploaded w/ scmake (I didn't what to scmake more than that since this repo has connections to the lake-temperature-model-prep repo, and we recently changed max depth values in some of those lakes).

jordansread commented 3 years ago

For

investigate why “nhdhr_59746189” has a pb0 nml file but does not appear in the pb0_config.json file. @jread-usgs to look into this. Maybe a model we created a config for but ultimately failed because it couldn't be run. Remove the nml file for this site (see bad_segfault_lake target)

I removed this file from the zip and got it back up on sciencebase. So now that lake isn't included. Verified the lake counts match our expectations now.

lindsayplatt commented 3 years ago

Re FGDC missing geoform issue. While the meddle template does have this templated for larger-cites, it is missing it for cross-sites. When we get this error for the larger-cites it is because we are truly missing the form: field (see here).

To fix:

  1. Update meddle so that cross-cites also has form.
  2. Add form: to ymls for larger-cites
lindsayplatt commented 3 years ago

I can update some but not all SB metadata items because some are still tied to the other data files.

@jread-usgs will need to complete 01 through 05 but will need to make sure that my general metadata changes are added to the pipeline (so blocked until this gets merged) & meddle gets updated first (so blocked until this is merged).