digitalearthpacific / dep-coastlines

GNU General Public License v3.0
3 stars 1 forks source link

Resolve environments #12

Closed jessjaco closed 1 year ago

jessjaco commented 1 year ago

Here's the current workflow (putting here rather than readme as it may change):

  1. Calculate tides using src/calculate_tides.py
  2. Filter by tides, create annual mosaics, and calculate water indices using src/water_indices.py
  3. Threshold, clean, and vectorize using src/clean_rasters.py

Currently 1 is done locally using he dea-australia/coastlines based docker image. It can be run remotely, as long as the tidal model data is available. It hasn't yet been run in the notebook or kbatch environment.

2 is done using kbatch and the planetary computer docker image (as noted in the yaml) with the addition of dep_tools (which is installed in src/run_on_pc.sh).

3 is done using kbatch and the dea-australia/coastlines based docker image.

We will need to decide if we want to maintain these different dev environments or try to use a single one. Here are confounding issues:

jessjaco commented 1 year ago

Think I resolved this.

jessjaco commented 1 year ago

Solution was to just install gcc on the planetary computer image before installing dea_tools, so hdstats wouldn't error out (Sachin's solution above worked, but using the "official" dea_tools, which doesn't have hdstats).

My only complaint is this makes the docker image rather large. I'm considering maintaining the use of the vanilla planetary computer image when running through kbatch for step 2 above.