gbif / pipelines

Pipelines for data processing (GBIF and LivingAtlases)
Apache License 2.0
40 stars 28 forks source link

#989 Oozie workflow for clustering #990

Closed timrobertson100 closed 10 months ago

timrobertson100 commented 10 months ago

This provides an Oozie workflow to run clustering and adds the steps to truncate the HBase table and load the HFiles in place.

timrobertson100 commented 10 months ago

Self merging after discussion with Nik